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Abstract 


A two-step classification algorithm for processing multispectral scanner 
data has been developed and tested. The algorithm is carried out by two 
separate programs called GLUSTX and GROUPX. The program CLUSTX is a single 
pass clustering algorithm that assigns each pixel, based on its spectral 
signature, to a particular cluster. The output of the program CLUSTX is a 
cluster tape in which a single integer is associated with each pixel. This 
integer is the cluster number to which the pixel has been assigned by the 
program. The cluster tape is used as the input to the classification program 
GROUPX. Ground truth Information is used in GROUPX to classify each cluster 
using an iterative method of potentials. Once the clusters have been assigned 
to classes the cluster tape is read pixel-by-plxel and an output tape is 
produced in which each pixel is assigned to Its proper class. The classifi- 
cation algorithm can be operated in a hierarchical manner in which each ground 
truth datura is classified at various levels in a classification tree. In 
addition to the digital classification programs, a method of using correlation 
clustering to process multispectral scanner data in real time by means of an 
interactive color video display is also described. 
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1 . 


Su mmary and Overview 


The research undertaken under this contract had^iis its goal the development 
and evaluation of various correlation techniques which might be useful in the 
processing of multispectral scanner data. This study is an outgrowth of work 
that was initially undertaken when the principal investigator was on sabbatical 
leave at the Johnson Space Center during the 1972-73 academic year. 

At that time the principal investigator developed a single-pass clustering 
1 

algorithm called CLUSTD that could be used as a nonsupervised classifier. In 

addition, the possibilities of using coherent optical methods in the processing 

2 

of multispectral scanner data were also studied. Considerable progress has 
been made under the present contract in clarifying the potential role of these 
techniques and significant advances In developing and evaluating these methods 
have been achieved. 

The major accomplishments of the current research effort include the 
following ; 

1) The overall digital processing of multispectral scanner data 
has been separated into two separate tasks. The first is to 
associate every pixel with a particular cluster by using a 
single-pass correlation clustering algorithm. The clusters are 
made small enough so that (nearly) all pixels in a given cluster 
will have very similar spectral signatures and therefore can be 
associated with the same class. The second task is to classify 
each cluster using ground truth information and thus, by 
association, to classify each pixel in the flight line. This 
separation of the processing tasks means that only a relatively 
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few spectral signatures need to be classified by the classifier 
(usually less than 200, corresponding to the spectral signatures 
associated with each cluster). As a result very powerful, non- 
linear, nonparametric classifiers can be used to classify these 
clusters, A more detailed description of this overall processing 
method is given in Section 3. 

2) Two new single-pass correlation clustering algorithms have been 
developed. These algorithms have replaced the original method 
used in CLUSTD that was based on a transformation of the spectral 
signature into a binary signature in which the elements were either 
+1 or ~i. The improved algorithms accomplish the same task without 
the need for this transformation. (This transformation was originally 
invented for an optical implementation in which it is required.) 

These single-pass clustering algorithms are grouped under the 
general name of CLUSTX and are described in more detail in 

Section 4. 

3) The single-pass clustering algorithms CLUSTX have been extensively 
studied. Ihe goal is to generate enough clusters so that all of 
the pixels in a given cluster will belong to a single class. This 
can obviously be achieved in the limit of one cluster per pixel. 

We have found, however, that with fewer than 200 clusters, over 
99% of all pixels in a given cluster will, oh the average, belong 
to one class . The best results are achieved when the physical 
separation of pixels associated with the same cluster is not allowed 
to become too great. 


2 


OEIGDtAB PAGBTS 
OF POOE QUALETF 



A new, nonparametric method of classifying the clusters based on 
an iterative method of potentials has been developed. This 
algorithm is described in more detail in Section 5 . In one version 
of the program, the training data for the classifier is taken to be 
the clus ters that have been assigned to classes based on the 
costiaatrix (i.e., by simply counting ground truth pixels in each 
cluster). This works well when large quantities of ground truth 
are available. For example, we have achieved an overall classifi- 
cation accuracy of approximately 94% when applying this method to 
the 12-channel aircraft scanner data of the C-1 flight line. The 
program GROUPX has been further improved by modifying it in such a 
way that the classifier based on the method of potentials can be 
trained directly from the ground truth pixels. This means that 
many fewer groijnd truth pixels are needed in order to effectively 
train the classifier. 

A new hierarchical classifier called CHIMP (for Classification 
Hierarchy using an Iterative Method of Potentials) has been 
developed. This classifier allows ground truth information 
to be stored in the form of a classification tree with various 
levels of detail. For example, the class com could be stored 
simultaneously as land, agricultural land, cultivated agricultural 
land, and com. Classification of all pixels can occur at any 
level. In addition, ground truth information can be entered at 
any level and used for classifying all higher levels. For example, 
a pixel may be known to be forest but the particular type of 
trees may be unknown. This pixel could be used as ground truth 



for classifying, for example at a forest, urban, agriculture 
level. This type of flexibility could greatly reduce the cost of 
acquiring ground truth by making maximum use of such things as 
aerial photographs. That is, all information that is known, at 
whatever level of detail can be handled by the classifier. A 
description of this new classifier is described in Section 6. 

New methods of processing large quantities of multispectral scanner 
data have been studied. This was the original motivation for 
investigating the possibilities of applying various optical techniques. 
In order to make a major advance in the use of multispectral scanner 
data the goal should be to develop a real-time processing system 
in which the human operator can interactively control the regions 
of feature space that are being observed. Such a real-time pro- 
cessor will require considerable parallel processing capabilities 
and optical methods seem to offer a possible choice. However, 
after an extensive study of the current optical processing 
technology it was concluded that the development of a real-time, 
interactive, color processing system is beyond the present 
state-of-the-art. Alternate technologies were then investigated 
and the preliminary design of a real-time processing system using 
a hybrid digital/analog system has been completed. This system, 
which could have a major impact on the usefulness and applications 
of multispectral scanner data, is described in Section 7. 
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Recommeadatlons 


As a result of the work done under this contract the following specific 
recommendations are made; 

A. Software 

1. It is recommended that the program CLUS‘3!X be made operational 
at WASA-JSC after the following improvements and modifications 
have been made. 

a) A preprocessing procedure should be included that will 
sample the data in order to determine the optimum window 
size (the threshold parameter) such that clusters are 
generated at an appropriate rate. 

b) A feature should be added that will start generating 
new clusters on a new file when the maximum number of 
clusters is reached or when a certain nr.mber of scan lines 
has been processed. This will minimize the problem of 
different classes that are widely separated on the ground 
but might have similar spectral signatures. In addition, 
it will allow an entire data tape to be processed at one 
time . 

c) The COSTMATRIX procedure should be provided as an option 

in CLTJSTX for evaluating the effectiveness of the clustering 
operation when ground truth information is available. This 
information should be stored on the cluster tape, 

d) The linear correlation measure and the rectangular 
correlation measure should be provided as alternate 
correlation measure options. 
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2, It is recommended that the program GROUPX be implemented at NASA-JSC 

after the following improvements and modifications have been made. 

a) An algorithm should be implemented that will automatically 
use all ground truth Information within the particular area 
corresponding to the data on the cluster tape. In addition 
ground truth from an area on either side of the region being 
processed should be used. With this modification multi-file 
cluster tapes can be processed with new ground truth information 
always being added from in front of the flight path while old 
ground truth corresponding to areas behind the flight path 

are being discarded, 

b) Modifications should be made that will allow the program to 

be compatible with multi-file cluster tapes. These modifications 
would produce multi-file output tapes, 

c) An option should he provided for classifying the clusters 
using either an Iterative Potential Function Method or a 
Gaussian Maximum Likelihood Method . 

d) The hierarchical classifier CHIMP that can classify at 
various levels of detail should he incorporated as an 
option in the program. 

e) An option that will produce a line printer classification 
map should be included. 

£) An option that will produce a PMIS-DAS tape output should 
he provided. 

g) The capability of input ing ground truth test data and pro- 
ducing an error matrix for testing the classification 
accuracy should be provided. 
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B . Hardware 

It is recommended that a prototype interactive color display system 
as described in Section 7 of this report be built and tested. The major 
parts of the system would include 

1. A high density magnetic disk assembly with 32 fixed head 
transducers , 

2, A tape drive and processor suitable for loading the fixed 
head refresh disk, 

3- A specially designed interactive analog processor incorporating 
high speed D/A converters, 

4. A color TV monitor. 
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3. Processing Multlspectral Scanner Data Using Correlation Clustering and 

Nonparametric Classification Techniques 

The classification algorithm developed under this contract is a two-step 
process carried out by two separate programs called CLUSTX and GROUPX. The 
functions of these two programs are illustrated in the block diagram of Fig. 1. 
The input data tape contains multlspectral scanner data in the ^orm of 8-bit 
integers representing, for each pixel, the reflectance measured in each of 
several spectral channels. Thus, associated with each pixel on the input data 
tape are NCHAN integers (ranging in value from 0 to 255) where NCHAN is the 
number of spectral channels. 

The program CLUSTX is a single pass clustering algorithm that assigns 
each pixel, based on its spectral signature, to one of NCLUST clusters. The 
maximum value of NCLUST is.MAXCLUST (typically MAXCLUST=200) . However, the 
actual value of NCLUST is variable and is determined bv two variable parameters 
in the program. The output of the program CLUSTX is a cluster tape in which a 
single integer is associated with each pixel. This integer is the cluster 
number to which the pixel has been assigned by the program. 

The clustering program CLUSTX can be considered to be a data reduction 
and preprocessing step in the classification algorithm. Thus, for example, 
whereas the original problem might be to classify each of say 40,000 pixels 
as one of 4 classes, CLUSTX reduces the problem to one of classifying each 
of a maximum of MAXCLUST clusters. The assumption is that enough clusters 
are chosen so that all pixels assigned to a particular cluster have very 
similar spectral signatures and thus belong to the same class. A spectral 
signature is associated with each cluster. This signature, is the average 
signature of all pixels that have been assigned to the cluster. A detailed 
description of the program CLUSTX is given in Section 4. 
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Flow Diagram for Processing Multispectral Scanner Data Using Correlation 
Clustering and Nonparametric Classification Techniques. 





The cluster tape which is the output of the program CLUSTX is the input 
to the classification program GROUPX. Ground truth information is used in 
GROUPX to classify each cluster as one of a small set of classes. Since the 
maximum number of possible clusters is 200 the number of items to be classified 
is relatively small. However, once the clusters have been assigned to classes 
the cluster tape is read pixel by pixel and an output tape is produced in 
which each pixel is assigned to its proper class. This output classification 
tape can then be used directly to produce classification maps, compute acreage 
of different classes, or test the accuracy of the classification method by 
comparing the results with additional ground truth. 

Ground truth infoirmation is used to train the classifier that will 
classify each pixel. This classifier creates nonlinear decision surfaces 
based on the method of potentials. Two types of training are possible. If 
the ground truth is limited then the spectral signatures from each pixel are 
used to construct the decision surfaces. On the other hand, if a large 
quantity of ground truth is available, then it can be used to produce a 
costraatrix giving the number of pixels in each cluster that belongs to each 
of the various classes. These numbers are used to estimate the a posteriori 
probabilities of a particular cluster belonging to a particular class. The 
cluster is then assigned to the class for which this a posteriori probability 
is a maximum. The clusters classified in this manner serve as the training 
data for constructing the decision surfaces using the potential functions. 

The remaining clusters are then classified using the method of potentials- A 
mote complete description of the method of potentials is given in Section 5. 
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The advantages of this classi f ication algorithm include the following 

1) The classification method is entirely nonparametric and 
thus avoids the errors that are inherent in estimating 
parameter vectors in parametric methods. This should 
lead to a more effective utilization of all of the infor- 
mation when data from a large number of channels is used. In 
particular, multimodal distributions of particular classes 
cause no problem. 

2) Changes in the spectral signature of a particular class 
along the flight line cause no problem as long as repre- 
sentative ground truth is available, since the result will 
simply be the generation of new clusters. These clusters 
will then be assigned to the proper class in GROTJPX. 

3) If new ground truth information is obtained only GROUPX 
needs to be run to produce a new output tape. 

4) The clustering can be done before the ground truth is 
obtained and the results of the clustering can be used 
as an aid in selecting ground truth areas. 
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. Data Reduction Usinc A Single Pass Correlation Clustering Algorithm 


The program CLUSTX is a single pass clustering algorithm that uses a 

correlation function as a similarity measure for assigning each pixel to a 

particular cluster. This correlation function is a measure of similarity 

between the spectral signature of a new pixel and the spectral signatures 

associated with previously generated clusters. Let x be the N-channel 

spectral signature associated with a particular pixel. That is» 

T f i) 

X = [x^, Let y he the average of the spectral signatures 

of all pixels that have previously been assigned to cluster number i. Let 
(i) 

- y. )bea weighting function associated with channel j whose 
1 3 J 

C i ^ 

value is a maximum at x, = y. and whose value becomes small as 

3 J 

[x^ “ y^ j increases. A possible example of the functions ~ ^ ^ 

for the case of 4-channel data is shown in Figure 2. 

The correlation function associated with the ith cluster is defined 


= I 4).(x. - 

j=l 

From the properties of the function c{) . it is clear that the maximum value of 

p(i) . 1 

C IS equal to 

(i) ^ 

C'‘ '' = I (j).(O) 

max 1 

and will occur when the spectral signature x is equal to the spectral 
signature y^^\ It is also clear that a large value of will occur 

when the spectral signatures x and y^^^ are similar, while a small value 
of will occur when x and y^^^ are dissimilar. Thus, can he 


used as a similarity measure to determine if the pixel with a spectral 
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Fig 2- An Example of Possible Weighting Fianctions 

for the Case of 4-Channel Data. 

j J 
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signature x should be assigned to the cluster whose average spectral 

( 1 ) 

signature is . The criterion used will be to assign x to cluster i 

if > C , where C , is a variable threshold level. In the interest of 

— mxn mxn 

f • \ 

efficiency the '^'s will be computed in the inverse order of cluster 

generation and the pixel will be assigned to the first cluster encountered 
(i) 

for which C > C . . If this condition does not hold for any of the 
— mxn 

clusters, then a new cluster is generated with the pixel as its first member. 

The algorithm thus consists of the following steps: 

1) Assign first pixel with spectral signature x to cluster 

( iTi 

number 1. Let y = x and set i = WCLUST =1. 

NEXT 2) Consider next pixel with spectral signature x. When pixels 
run out, STOP 
LOOP 3) If i > 1 

Then Compute C^^^ = E d).(x. - 

If C^^^ > C . 

— mxn 

Then assign pixel to cluster number i and update 

(i) 

cluster signature y . GO TO NEXT 
Else let i = i - 1 and GO TO LOOP 
Else create a new cluster by letting NCLUST = NCLUST +1, 
i = NCLUST, and setting y^^^ = x. GO TO NEXT. 

In practice this algorithm may be modified so that instead of checking 
all of the clusters only the NBACK most recently generated clusters are 
checked before a new cluster is generated. 

Two different versions of this clustering algorithm have been implemented. 
One uses the linear correlation weighting function shown as the third example 
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in Fig. 2. The second implementation uses the rectangular weighting function 
shovm as the second example in Fig. 2. Both implementations produce satis- 
factory clustering results. 
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5. 


Pattern Classification Using an Iterative Method of Potentials 


The goal of computer-aided pattern recognition is to automatically 

4-6 

classify objects into distinct classes or states of nature. If there 
are M such classes for a given problem and i-l» M represents the ith 
class, then let P(w^) be the a priori probability of an object belonging to 
class i. If this was the only information available then the best decision 
rule is to always guess that an object belongs to the class for which 

is a maximum. This rule will result in the minimum probability, of 

error . 

However , one normally has more information available with xjhich to 
make a decision. This information will be assumed to be in the form of a 
measurement or feature vector x where x = [x-,x , ...,x ]. The components 
of this vector represent measurements on the object to be classified. For 
example, in multispectral scanner data the components of x represent the 
reflectance in each of N different spectral channels. 

Having made an observation x the a posteriori probability P(dJ^|x) 
that the object belongs to class given that x was measured is given by 
Bayes rule 

p(x|iJlJ.) P(tl).) 

PK|x) = ^ (5-1) 

i » P(s) 

where p(xjw^) is the state conditional probability density of x and p(x) 

is the total probability density 

M 

p(x) = I • (5-2) 

The decision rule is nox*7 to assign an object to class i if 

P(fjJ. |x) > P( 0 ).jx) for all j r i- Points in the feature space of x for which 
i “ J ~ ~ 
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P(ui. |x) = PCoj.Jx) lie on a decision boundary which separates class regions 
in the feature space. The decision rule can be generalized by introducing 
a loss matrix L„ representing the loss associated with choosing when 
the actual class is A classifier that minimizes the total expected 

4 

loss is called a Bayes classifier. The effect of various loss functions is 
to shift the decision boundaries in feature space so as to give more or less 
weight to a given decision. 

It is common practice in statistical pattern recognition to assume 
that all classes have equal a priori probabilities arid that the loss 

matrix is equal to 0 if i = j (no loss for choosing the correct class) and 
is equal to 1 if i j (a unit loss for making a mistake) . Under these 
assumptions the Bayesian decision rule is to choose o)^ if P(o)^|x) > PCw^jx) 
or p(x|o),) > p(x|{i3.) for all j i. 

Alternatively, any monotonically increasing function of PCw^^jx) can 
be used as a discriminant function g^(x) . The decision rule is then to choose 


class 0). if g.(x) > g . (x) for all j 5^ i. The logarithm of P((jj,|x) is often 

4 

used as a discriminant function. 

In general, the state conditional probability densities p(x|a)^) are 
not known. One common practice is to assume that p(xjcD^) is a multivariate 
normal distribution and labeled training samples are used to compute maximum 
likelihood estimates of the mean vector and covariance matrix for each class. 

There are two major potential pitfalls to this approach. First of all, 
if the training data for a particular class is not really normally distributed 
then serious errors can occur . This is particularly true if the data is 
multimodal and precautions (such as applying preliminary clustering algorithms) 
have not been taken to discover this fact. Secondly, and possibly more 
serious, is the fact that the number of samples needed to obtain reasonably 
good estimates of the mean vector and covariance matrix increases dramati- 
cally as the number of features goes up. Thus, while one would expect that 
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adding new features to the measurement vector would increase class discrimi- 
nation it is a common practical result that classification performance often 
deteriorates as the number of features increases beyond a certain point. 

This phenomenon can usually be traced to the fact that there are not enough 
training samples to provide an accurate estimate of the probability density 
parameters. 

In order to overcome this problem of too many dimensions in the feature 

vector a wide variety of feature selection algorithms have been developed. 

The goal of these algorithms is to reduce the dimensions of the feature space 

while at the same time trying to maintain the best possible discrimination 

between classes. However, the class discrimination can never be as good as 

when all features are used. This situation has led to the search for non- 

parametric methods in which the problems associated with statistical pat’a- 

meter estimations would be alleviated. 

Nonparametric techniques have been used to estimate both the state 

conditional probability density p(x|oJ^)^ and the a posteriori probability 
8 

P(oj^)x). Alternatively, methods have been developed that determine the 
discriminant functions g.(x) directly from the labeled training samples. The 

^ A- 

most popular of these techniques are the linear discriminant functions which 

9 

divide the feature space into class regions by means of hyperplanes. The 
main problem with linear discriminant functions is that there are many classi- 
fication problems in which the classes may be separable with nonlinear 
discriminant functions but are not separable with linear discriminant 
function,^. 

The final goal of any of the classification schemes is to associate 
every region in feature space with a particular class (or a probability of 
belonging to a class) in such a way that the best possible classification 
accuracy is achieved in practice. 
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The classifier described in this section is a nonpararaetric classifier 

that produces nonlinear decision surfaces or discriminant functions by means 

of an iterative method that continually warps the decision surfaces in such 

a way that all labeled training samples remain correctly classified. When 

classifying an unknown object with a feature vector Xj the M discriminant 

functions g^(x)» i = 1,M, are computed and the object is assigned to the 

class 1 for which g . (x) > g - (x) for all j i. . 

t ~ 3 ■v* 

This classifier is related to a class of methods referred to as the 
method of potentlals.^’^^’^^ In all such methods an interpolating or potential 
function is associated with labeled sample points. The cumulative sums of 
such potential functions form the discriminant functions used for classi- 
fication. In the most common version of this method a potential function 

is added to the discriminant function only when a labeled samples is mis- 

12-14 

classified by the discriminant functions formed up to that point. This 

recursive algorithm for forming the discriminant functions is applied inter- 
atively until all labeled samples are classified correctly. 

The advantage of this method of potentials is that only those samples 
that are misclassif ied need to be stored to compute the discriminant functions. 
However, although all training samples are classified correctly there is no 
reason to believe that the resulting discriminant functions are related in 
any way to the a posteriori probabilities P(W.|x) and thus there is no 

X 

reason to believe that good classification results will occur with test data. 

The classifier described in this section uses a modified approach in 

, , 15, H 

which a potential function is associated with each labeled trainxng sample. 

This approach is similar to the use of Parzen windows for the estimation 
of probability densities,^ However, an iteration technique is used in which 
a positive weighting factor is applied each time a labeled sample is raisclas- 
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sified. In this way the resulting cumulative discriminant functions continu- 
ally warp themselves until all labeled training samples are correctly classified. 

Although this method has been recognized as a very general and powerful 

classification technique, it has been criticized in the past for its compu- 

16 

rational problems and excessive storage requirement. However, these pro- 
blems have been largely overcome in the classifier described here. 

If a number of labeled samples belonging to the same class have feature 
vectors that are very close together in feature space then for the purpose 
of forming a cumulative potential function or discriminant function these many 
feature vectors may be replaced by a single ''potential center" located at 
the mean of the vectors being replaced and the new single potential function 
is given a weight equal to the number of labeled samples that it represents. 

In this way the storage requirements can be kept to manageable proportions, 

Por example, a resulting discriminant function that was formed from, say, 

100 potential centers could represent an extremely complex, non-linear decision 
surface. 

the classifier described in Section 6 checks each labeled training 
sample as it is presented to the classifier to see if it can be combined with 
an existing potential center. It does this by using a correlation clustering 
algorithm. In this way an efficient, but very powerful classifier is achieved. 

Another unique feature of the classifier described in Section 6 is the 
hierarchical manner in which the training data is stored in the computer. 

Each labeled sample can be assigned to a class at a number of different levels 
of specificity. For example, bad corn could be simultaneously classified as 
land, agricultural land, cultivated agricultural land, corn, and bad corn. 

All training data can then be stored as a classification tree in which more 
and more detail is achieved by going further down the tree. The classifier 
is able to classify an object at any level in the classification tree. 
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Classifiers based on the method of potentials have been recognized as 
being superior to statistical classifiers when the amount of training data 
is limited. This is often the case when processing multispectral scanner 
data since the cost of acquiring reliable ground truth can be very high. 
When this cost is taken into account then a more powerful classifier that 
can work well with a limited amount of ground truth may be more economical 
even if its processing time is longer. 

The main advantages of the classifier described In this report can 
be summarized as follows: 

1) It is a nonparametric classifier that works well with multimodal 
data and whose performance should continue to improve as the 
dimension of the feature vector is increased. 

2) It is trained iteratively in such a way that all training data 
are correctly classified by the classifier. 

3) It can equally well handle a large amount of training data 
(by using correlation clustering to reduce the number of 
potential centers) or a small amount of training data (by 
using each training sample as a potential center). 

4) It can classify at various levels of detail by storing the 
training samples in the form of a classification tree. 

5) It can be trained over a period of time, getting better and 
better as additional ground truth information becomes available. 

5.1 Discriminant Functions Formed by an Iterative Application of Potential 

Functions 

The method of potentials uses labeled training samples to form, non- 
linear discriminant functions that can be used to classify test data. Let 

be the feature vector associated with the k sample of class i. An 

-.k 
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interpolating, or potential function K(x,x, is defined to be a function 

(i) 


that is maxitnum when x = x. 


and decreases monotonicalJy as [x - 


increases. Specific potential functions that have been used include 




1 + a| {x - X 


(i) 


(5-3) 


and 


K.(x,x^^^^) = expC-aijx - 

An estimate p(xjw.) of the state conditional probability density 


(5-4) 


~ 2 . 


p(x|w.) can be obtained by erecting a potential function K(x,x ^^^) at each 

^ X •-'K. 

of the samples of class i, adding all of these functions and dividing by 


N. . That is, 
1 ^ 


p(xlto ) = ^ z K(x,x ^^^) 
~ ^ ”i k=l ^ 


(5-5) 


The division by in Eq.(5) accounts for the fact that ere may be 

different numbers of samples in different classes* If all classes have 

equal a priori probabilities then, from Eq, (5-1), would also be 

proportional to an estimate of the a posteriori probability P(t^^|x), One might 

then consider using a discriminant function G^(x) equal to p(x|w^) given by 

Eq. (5-5) and then classify objects according to the following decision rule: 

Assign an object characteriaed by the feature vector x to class i if 

G.(x) > G.(x) for all j ^ i- 
1 « 1 ~ 

On the other hand if the training data is obtained by randomly sampling 

all objects to be classified, then the number of training samples obtained 

for each class is, in some sense, related to the a priori probabilities of 

class membership. In particular if is assumed to be proportional to P(w^) 

then by comparing Eqs. (5-1) and (5-5) it is clear that an estimate p(w^]x) of 

the a posteriori probabilities P(w. lx) can be taken to be 

N • ^ 

'P(w. |x) = A K(x,x ^^^) C5-6) 

^ ~ k=l " “ 

where A is some proportionality constant. A useful discriminant function 
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for class i might therefore be taken to be 

Nf 

G.(x) = I (5-7) 

^ “ k=l - 

The decision rule is to assign an object to class i if G^(x)> 
all j i. 

The location of the decision boundaries generated by the discriminant 
functions of Eq. (5-7) will depend on the number of training samples of each 
class N^. As has been noted this shifting of the decision boundaries is 
meant in some sense to account for the a priori probabilities. However, there 
is no guarantee that the discriminant functions given by Eq. (5-7) will classify 
all of the labeled training samples correctly. This situation can be corrected 

by using an iterative error-correcting scheme that adds a weighting factor to 

( i) C i) is 

the potential function K(x,x^ ) each time that the labeled sample 

( i ^ 

misclassif ied by G^(x ). The discriminant functions G^(x) given by Eq. (5-7) 

are therefore modified by the following error-correcting algorithm. 

For each labeled sample x, the discriminant functions G.(x, are 

~x 1 

(fi.) (£) 

computed for all classes. If G£(Xj^, > G^(Xj^, ) for all i ^ , then 

Gjj^(x) is left unchanged and the next labeled sample is considered. On the 

(P ) (P ) 

other hand if, for any i, Gn (x , ) ^ G.(x , ) then G (x) is changed to 

tC X ^ K 

(9 ) 

Gj^(x) + XK(x,x^^,0 where X is a constant. This rule is applied Iteratively 
to all labeled samples until all of the labeled samples remain correctly 
classified. After convergence the resulting discriminant functions are 
thus given by 


N± 

G.(x) = T (1+XC.,) K(x,x5^^) 


(5-8) 


,(i) 


where C., is the number of times that the labeled sample x, caused a 
ik .vk 


change. 

The discriminant functions given by Eq. (5-8) are used to classify test 
data by assigning an object to class i if G.(x) > G.(x) for all j i, 

X — 1 ~ 
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5 . 2 Using the Potenti a l Function Classifier in the Program GROUPX 

As described in Section 3 the purpose of the program GROUPX is to 
classify each of the clusters that have been created by the program CLUSTX. 
The program has a procedure called COSTMATRIX that computes a cost matrix 
using ground truth data. Fig. 3 shows an example of the costmatrix that is 
produced by the procedure COSTMATRIX. In this figure the rows correspond 
to cluster numbers and the columns correspond to class numbers. The numbers 
within the costmatrix are equal to the number of pixels belonging to a parti- 
cular cluster that are known from ground truth to belong to a certain class. 
Since it is desirable that all pixels in a given cluster belong to the same 
class one would hope that only one column in e^*.ch row of the costmatrix is 
nonzero. In any event the cluster is assigned to that class corresponding to 
the column containing the largest nimber of pixels. This selection is 
indicated on the right-hand side of each row together with a percentage 
indicating what percentage of the total of each row this maximum number 
represents, A figure of 100% means that all ground truth pixels in that 
cluster belong to one class. This is obviously the desired state of affairs. 

Printed at the bottom of each column of the costmatrix is the number of 
ground truth pixels that belong to clusters that have been assigned to that 
class. The ratio of this number to the sum of all pixels in that column 
is also printed as a percentage and is a measure of the percent correct 


classification. 

Those rows in the costmatrix corresponding to clusters containing pixels 
for which no ground truth exists will contain all zeros. These clusters 
must be classified using the method of potentials. It is also possible to 
train the potential function classifier directly from the ground truth for 
individual pixels and to then classify all clusters using the method of. 
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Fig. 3 CGSTMATRIX Output 


potentials. In this case the costmatrix is used only for informational 
purposes as an indication of how well the clustering algorithm CLUXTX 
performed. 

The discriminant functions given by Eqs. (5-8) and (5-3) are used to 
classify all of the unlabeled samples. Thus» if y is the spectral signa- 
ture associated with an unclassified cluster then y is assigned to class 
i if (y) > Gj (y) for all j ^ i. 

After all of the clusters have been assigned to a class, the input 
cluster tape is read and the cluster number associated with each pixel on 
the cluster tape is translated into a corresponding class number on the 
output tape- 

The effectiveness of the algorithm GROUPX that uses a Potential 
Function Method (PFM) classifier can be demonstrated by comparing its 
performance to that of a Gaussian Maximum Likelihood (GML) classifier. 

Both types of classifiers have been used to classify ERTS data containing 
agricultural fields in Fayette County. 

An area containing 12,726 pixels was selected of which a total of 
297 pixels of ground truth was available. This ground truth consisted of 
six classes and was divided between training data and test data according 
to the following table. 


TABLE OF GROUND TRUTH 



Class 

No. of Training Pixels 

No. of Test Pixels 

1. 

Soybean 

116 

25 

2. 

Corn 

40 

10 

3. 

Wheat 

14 

3 

4. 

Woods 

52 

16 

5. 

Bare Soil 

13 

4 

6. 

Clover 

3 

1 



238 

59 


26 



When the GML classifier was applied to this data, the covariance matrix 
for class 6 was found to be non-positive definite (the matrix was singular) 
and tVins this class could not be Included in the classification. The 
remaining 58 test pixels were classified by the GML classifier and the 
results are summarized in the error matrix of Fig. 4. These results show 
that all of the test pixels are classified as either class 3 or class 4. 

This is undoubtedly due to the fact that an insufficient quantity of training 
data yields inaccurate estimates of the mean vectors and covariance matrices. 

The PFM classifier GROUPL was then applied to this same data. The pro- 
cedure COSTMATRIX classified the following number of clusters that were used 
as potential centers for training the potential method classifier: 



CLASS 

No. of TRAINING POTENTIAL CENTERS 

1. 

Soyb^jan 

17 

2. 

Corn 

5 

3. 

Wheat 

1 

4. 

Woods 

7 

5. 

Bare Soil 

1 

6. 

Clover 

0 


A value of Ot = 5.0 and X = 1.0 in Eqs. (5-8) and (5-3) resulted in 
training convergence after a single iteration. A total of 51 test pixels 
were then classified and the resulting error matrix is shown in Fig, 5. 

Both the GML and PFM classifiers were used in similar version of GROUPX. 
ALGOL listings of both programs and given in the Appendix. The original 
scanner data had been clustered into 137 clusters using CLUSTX. The overall 
performance of the costmatrix using the training data was about 90%, A 
higher percentage could have been achieved by generating more clusters. Thus, 
relative to this upper limit a more accurate measure of the GML classifier 
would be an overall accuracy of 17.24/90 = 19.2%, while the overall accuracy 
of the PFM classifier is 84.31/90 = 93.6%. The total processing time for the 
version of GROUPX containing the GML classifier and for the version containing 


27 



the PFM classifier was 1 min. 17 sec. and 1 min. 29 sec. respectively. All 
programs were run on a Burroughs B-5500 computer - 

Another version of GROTJPX was tested that used a PFM classifier that 
trained on individual pixels rather than cluster centers obtained from the 
COSTMATRIX. In order to keep the number of potential centers or training data 
to a minimum (an actual advantage in PFM classifiers I) the test data used 
previously was used for training and the original training data was classified. 

A total of 136 pixels were classified using 79 different pixels as training 
potential centers. The resulting error matrix is shown in Fig. 6. 

The same data was used to train the GML classifier and the result of 
classifying the same 136 test pixels used in Fig. 6 is sho^ra by the error 
matrix in Fig. 7. The covariance matrix for class 7 was singular and there- 
fore the three text pixels for that class could not be classified. 

The results given above indicate the superiority of the new PFM 
classifiers over the GML classifiers. The main reasons for this improved 
performance include: 

1) Even limited quantities of ground truth can be effectively used by 
the PFM classifiers while the same ground truth may yield covariance 
matrices that are either singular or so grossly in error as to be 
meaningless . 

2) The pre-clustering of the training data (by using the COSTMATRIX) 
results in a manageable number of potential centers that represent; a 
faithful sampling of all available training samples. 

3) I'Jhen the training data is not normally distributed or is even multi-modal 
the PFM classifiers have no problem in forming accurate decision boundaries. 
On the other hand, the GML classifier may produce totally Inaccurate 
results in these instances. 
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6. A Nonparametric , Hierarchical Classifier 

In many classification problems, the taxonomy in which objects are 
to be placed is intrinsically hierarchical. Figure 8 illustrates a possible 
taxonomy for use in classifying areas of a photograph of the earth's surface. 

At the lowest level of classification (level i) a pixel is classified as 
either land or water. The next level of classification further subdivides 
the first level. The taxonomy sho™ indicates no further classification of 
water (for example into warm or cold) but a pixel classified as land at level 1 
might further be classified at level 2 as urban, woods, bare soil, or agricultural. 
The algorithm described in this section is able to classify an object to any 
desired level in such a hierarchical classification system. 

We assume a set of labeled training data, each of which has been classi“ 
fied. In order to classify an unknown object at level 1 (water or land) we 
need two sublists of our training data, one of all training data labeled water, 
the other of all training data labeled land. Using the iterative method of 
potentials described in Section 5.1, the discriminant function for each of these 
sublists is evaluated at the point in feature space corresponding to the 
spectral signature of the unknown pixel. Then the unknown pixel is determined 
to he either water or land according to which sublist yields the largest dis- 
crimlnant function value , To further classify the unknown pixel at level 2, 
we would need four other sublists of labeled training data: urban, woods, 

bare soil and agricultural, A training datum classified as, say, corn (Fig. 8) 
would be included in four sublists. 

*In the algorithm described below, a further requirement is made; namely that 
this largest discriminant function value must exceed some minimal threshold. 
Otherwise, the object is unclassif iable at that level. 
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Clearly, to implement hierarchical classification we need rapid access 
to various sublists of the training data. Representing the training data as 
subsets of lists is facilitated through the use of a multi-linked data structure. 
Each of the labeled training data is stored in a node 


U U ^ 1 L — 9 Node for Storing 

■' ^ —J Training Data 

Datum Descriptor List Pointers 

as shoira in Fig. 9. The datum "value” is stored along with a set of links 
or pointers. These pointers are used to associate that datum with those 
sublists to which it belongs. For each node, a pointer is required for each 
level of classification. If a given datum is the only member of a sublist 
at, say, level 2, the corresponding pointer is set to null (zero); otherwise 

it is set to point to the most recent datum that is a member of that sublist. 

The classification tree of Fig. 8 has 4 levels; hence, the nodes for storing 
the training data will each have 4 pointers. 

To facilitate the addition and deletion of training data nodes, it is 
convenient to include one more pointer with each node. This pointer is used 

simply to link all of the data into a list. If any training da;.um is to be 

discarded, this pointer can be used to link the unused node storage box to 
the free storage stack. When a new datum is required, the needed node storage 
box is removed from the free storage stack. This memory management technique 
insures that all available memory for training data storage is accessable. 

The actual arrays used to implement the multi-linked list structures 

tlr 

are now described. The i node box is composed of the i row (or element) 
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of the arrays described in Table 1 


Name 


Dimension 


FEATURE 

CLASS 

CLINK 

CLISTLINK 


NROWS X NCHAN 
NROWS X NLEV 
NROWS X NLEV 
NROWS 


Table 1 Arrays used for multi-linked list storage 


The array FEATURE is used to store the training data feature vectors, each 
of which is a vector of length NCHAN. The array CLASS is used to store the 
classification data. Each datum is classified according to the given classi- 
fication tree, where the branches are labeled sequentially as shown in Fig, 8. 
NLEV is the depth of the classification tree and therefore represents the length 
of a classification vector. Each row of the array CLINK holds the pointers 
needed to link that node into each of the respective sublists. Finally, the 
array CLISTLINK is used to link all of these nodes together in one list. 

The unused locations are also linked into a list (stack) by means of 
CLISTLINK. The location of the top of this stack is held in the variable 
CAVAIL. 


The data structure described above is adequate for storing all of the 
sublists of the labeled training data. However, it does not inherently provide 
rapid access to each sublist. For example, if it is known that a given train- 
ing datura is a member of the "land** sublist, then its level 1 pointer points 
to the ’’next" element of the "land" sublist. However, no mechanism is pro- 
vided for locating the beginning of the "land" sublist. Yet another data 
structure is required to provide access to the beginning of any desired sublist. 
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The data structure used here to access the sublists is a tree having 
a form similar to that of Fig. 8. Each node of the tree contains the address 
of the beginning node of a training data sublist. By "climb ing'^ through this 
tree any desired sublist can be located. 


A typical tree node is shown in Fig. 10. Storage is 



provided for a pointer to the head of a sublist. Also provided are pointers 
which locate the "sons" of that tree node. If the tree node has no sons 
(i.e.j a terminal or "leaf") then all of the son pointers are null (zero). 

In the present Implementation* a fixed miraher of son pointers is used; 
this number must equal the maximal "fan out" of the classification tree. The 
subclasses of a node are denoted by integers: 1,2,3,..; the absence of the 

subclass (or a sublist) is denoted by a null (zero) value in the i*"^ son 
pointer position. 

An array TNODE, with dimensions NODES x NSONS, is used to store the 
ttl til 

access tree. The 1 row of this array stores the 1 node box. The unused 
storage locations are linked together in a free storage stack using the first 
column elements of the array. The location of the top of this stack is held 
in the variable TAVAIL. The location of the root of the tree is held in the 
variable TROOT. 

In summary, two data structures are used for storing and accessing the 
training data. A multi-linked list structure is used to represent the data as 
a set of mutually inclusive lists (or sublists) . A tree structure is used 
to provide access to the various sublists. Together they provide the data 
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storage and access needed to support the algorithm for hierarchical classifi- 
cation by the method of potentials. 

6.1 The Classifier CHIMP 


The algorithm described below is called CHIMP (for Classification 
Hierarchy using an Iterative Method of Potentials) . There are three major 
phases in this algorithm: training data input, training, and classification. 

These phases of the algorithm are discussed separately. Reference is made to 
the listing of the algorithm in Appendix D, 

6.1.1 Training Data Input 

The function of this phase of the algorithm is to enter the training 
data (from TDFILE) into the multi-linked list structure and to construct the 
corresponding sublist access tree. As each labeled training datum is received 
by the procedure INPUT, the access tree is extended, if necessary, by the pro- 
cedure SETTREE to accomodate that new datum. SETTREE leaves behind a TRAIL 
vector of tree node locations which contain pointers to the sublists in which 
the new datum will be included. (These sublists may, of course, be empty, 
in which case the new datum will be the first element in the lists.) 

An important task carried out by the procedure INPUT is that of 
clustering the training data. Stated simply, training data that are suffi- 
ciently "close" to each other in feature space will be combined into a single 
datum located in feature space at the center of gravity of the included data. 
The number of data associated with each "cluster" or "potential center" is 
stored in a corresponding element of the vector WEIGHT (of length NROWS) . 

A new datum will be clustered with an existing one if the new datum falls 
within the WINDOW of the existing one, which WINDOW is a hypercube centered 
on the existing datum with half width WINDOWSIZE. The procedure INWINDOW 
determines whether or not the new datum is to be clustered. 

pagb is 
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Clustering is only done with data that match at all levels of classi- 
fication. Therefore, after deploying SETTREE, the procedure INPUT searches 
the deepest sublist, to which the new datum would belong, to see if the new 
datum is in the WINDOW of any existing datum. If so, then they are clustered 
and INPUT terminates. If not, then the new datum is placed in a new storage 
node which is then linked into each of the sublists identified in TRAIL. 

6-1.2 Training 

The potential function used in this algorithm has Che form 


1 + XCOUNT. 

f .(x,x.) = WEIGHT(x.) — — — ^ 

J ~ M m2 

1 + a||x-x il 


(6-1) 


where x. is a feature vector of a training data cluster, 

X is the feature vector of the unknown datum, 

X and Ot are scalar parameters, 

WEIGHT(x.) is the number of training data associated with x,, 

II * I j is the Euclidean norm, 

and COUNT^, is a counter used In training as described below. 

The discriminant function for a subclass (sublist of potential centers) 
is 

D (x) = 2 f(x,x.) (6-2) 

k ~ j - ~3 

where the summation is over all elements in the sublist corresponding to the 
subclass k. 
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An example will Illustrate the use of these discriminant functions. 
Suppose we wish to classify an unknown object with feature vector x at 
level 1 as either water or land (Fig. 8). Two discriminant functions are 
required: and is defined over the subclass of alt training data 

labeled ’’water,” and D (x) is defined over the sublist labeled "land." 

D^(x) and Dj^(x) are evaluated and x is classified according to which dis- 
criminant function value is greater. If the unknown x is classified at 
level 1 as land then it can further be classified at level 2 as ’’urban,” 
’’woods,” ”bare-soil” or ’’agricultural.” To do this classification at level 2, 
four discriminant function values are needed, corresponding to the four 
training data subiists. Again, the unknown x is classified according to 
which discriminant function value is greatest. 

Implicit in this algorithm is the requirement that the potential centers, 
resulting from the training data, be themselves classified correctly by the 
discriminant functions. If the value of COUNT^ in Eq. (6-1) is zero it 
usually happens that some of the potential centers would not be correctly 
classified by the method. To overcome this shortcoming, COUNT is introduced 
into the potential function and adjusted until each potential center is 
correctly classified by the discriminant functions. 

The adjustment of the COUNT^ 's is an iterative procedure as described 
in Section 5. During each pass, the classification of each member of each 
sublist is checked by the procedure CLASSIFIEDCORRECTLY. If the classification 
is wrong, the value of COUNT^ for that sublist element is incremented. Note 
that since each potential center can belong to as many as NLEV sublists, 
then each potential center may have NLEV values of COUNT associated with it, 
one for each sublist to which it belongs. If, during the first pass, any 
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sublist element were incorrectly classified, then the whole procedure of 
checking the sublist elements is repeated a second time to determine whether 
or not the new COUNT values were sufficient to yield accurate classification. 
This procedure is repeated until all sublist elements are correctly classi- 
fied (or until a limit on the number of these iterations is reached) . This 
completes the training phase of the algorithm. 

The procedure TRAIN is the driving procedure for this phase. It 
executes training passes by deploying TREECHECKER until training is success- 
ful, or for a maximum of 20 times. Also, it reports on the results of each 
pass. 

The procedure TREECHECKER traverses the sublist access tree and deploys 
CHECKSUBLIST for each sublist accessed by the tree. TREECHECKER returns a 
Boolean value indicating whether or not all of the elements in all of the 
sublists were classified correctly. 

The procedure CHECKSUBLIST traverses a sublist and deploys CLASSIFIED- 
CORRECTLY to determine whether or not each list element is correctly classi- 
fied. If a list element is not classified correctly then the corresponding 
COUNT is incremented. CHECKSUBLIST returns a Boolean value indicating 
whether or not all elements in the sublist are classified correctly. 

6,1.3 Classification 

The previous example on the use of the discriminant functions given in 
Section 6.1.2 describes the general technique for classification of an 
unknown. The procedure CLASSIFY carries out the classification of an unknown 
feature. The classification is carried out only to a depth specified by the 
parameter MAXLfeVEL (but, of course, not to exceed NLEV - the maximal depth of 
the tree). The unknoxm, to be classified by CLASSIFY, is held in the vector 
NEWFEATURE (which is taken from a row of the input array SIG) . The vector 
of classification results produced by CLASSIFY is held in NEWCLASS. 
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The operation of CLASSIFY can be understood by following it through 
one level of classification. A pointer P is set to the root of the access 
tree. The local variables BIGCLASS and BIGVALUE are set to zero at the 
beginning of each new level of classification. Each of the sublist discri- 
minant functions is evaluated and compared In turn with the current value of 
BIGVALUE, If the discriminant function value is greater than BIGVALUE, then 
BIGVALUE is replaced and BIGCLASS is replaced by the sublist index. The 
sublists are accessed through the sublist pointers and those tree nodes that 
are the sons of the root node. TNODE [TROOT, 1+K] points to the Kth son of 
TROOT. Therefore, TNODE [TNODE [TROOT, 1+K] ,1 ] points to the sublist corresponding 
to the Kth son of TROOT. 

When all of the sublist discriminant functions for the sons of TROOT 
have been compared to BIGVALUE, then BIGVALUE will contain the value of the 
largest discriminant function and BIGCLASS will contain the corresponding 
classification. However, before the assignment of BIGCLASS to NEWCLASS is 
made, it is required that BIGVALUE exceed THRESHOLD. The a priori assumption 
here is that if the greatest discriminant function value falls below THRESHOLD 
then no proper classification can really be made. In this event, ~1 is 
placed at the appropriate level in NEWCLASS and the procedure terminates. 

However, if BIGVALUE exceeds THRESHOLD then BIGCLASS is placed in the 
proper (first) element of NEWCLASS. Then the local pointer P is moved down 
the tree one level to the Kth son of TROOT, where K = BIGCLASS. This process 
is then repeated until MAXLEVEL is reached or until a terminal is reached. 

In summary, the major advantages of the classifier CHIMP include; 

1) Labeled samples can be represented by a hierarchical tree structure 
and unknown objects can he classified at any level of the tree. 

Labeled samples that are known at only a certain level can be 
used to train the classifier up to that level. 
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2) The classifier can produce good results with a limited amount of 
training data. This is in sharp contrast to parametric classifiers 
such as a Gaussian maximum likelihood classifier for which consider- 
able training data is required, particularly for higher dimensional 
feature vectors. 

3) The classifier can operate well when large quantities of training 
data are used. Previous attempts to use potential function methods 
with large amounts of training data have been plagued with computa- 
tional difficulties. CHIMP incorporates an automatic clustering 
algorithm that reduces the training samples to a manageable number 
of potential centers. These potential centers represent a faithful 
sampling of all available training samples. 

4) The classifier CHIMP can produce very general, nonlinear decision 
boundaries. These decision surfaces can be used to accurately 
classify multi-modal as well as unimodal data. 
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7. 


An Interactive Color Display for Multispectral Imagery Using Correlation 
Clustering 


Two distinct but complementary approaches to the processing of multi- 
spectral scanner data have been followed. One approach focuses on digital 
processing and has as its goal the classification of each ground resolution 
element, or pixel, in a given area. Sections 3-6 of this report describe an 
example of this approach that uses correlation clustering and nonparametric 
classification techniques to classify each pixel. The second general approach 
uses a variety of techniques to produce color maps of the ground area that are 
suitable for visual inspection and interpretation by humans. One common 
method is to use the intensity of one color (red, green, or blue) to represent 
the intensity of the reflected energy in one of three channels. If these 
three color images are superimposed (either photographically or with a color 
video system) then a full color map is obtained. 

There are a number of limitations to the color maps ptoduced in this way. 
First of all, since one color is associated with one particular spectral 
channel of the data it is difficult to produce a map that uses data from more 
than three different spectral channels. On the other hand, multispectral 
scanners with up to 24 spectral channels have been bui’t. Even if one uses 
data from multiple-passes of the 4-channel ERTS multispectral scanner, then 8, 
12, or 16 effective channels of data (combinations of spectral and temporal) 
would not be uncommon. 

In an effort to include information from more than three channels a 
member of digital processing techniques, including various clustering methods, 
hc-.ve been developed. The results of such digital processing can be used to 
oroduce color maps with display systems such as NASA's PMIS-DAS system at the 
Johnson Space Center in Houston. 
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Is it possible to process raultispectral scanner data in an unsupervised 

manner and produce color classification maps interactively in real time? In 

this section we will describe the design of such an interactive color display 

system that uses correlation clustering techniques to produce color maps of 

, . 18 

multi spectral imagery in real txme. 

Fig. 11 illustrates hoxij the system would be used. An operator sits at 
the color display screen and has access to a number of control knobs located 
on the console. The color display contains an image of a certain ground area 
made up of, say, 500 x 500 pixels. The operator can adjust the knobs such that 
the entire screen is a single color. Additional adjustment will produce a 
broad level classification map in which perhaps all water appears blue, agri- 
cultural land appears green and forests appear red. Further adjustments might 
result in only the agricultural fields appearing in color with different colors 
representing different types of crops. In other words, the operator can "tune 
in" to as much detail as he wishes using his own judgment to interact with the 
image causing it to change in real time. 

It is important to understand that the processing that is going on is 
entirely unsupervised in the sense that no a priori ground truth information 
is used. On the other hand the operator "supervises" the processing in an 
Interactive mode and may very well use a priori information that he has about 
the general nature of the area in order to produce a useful map. 

Obtaining good ground truth information may well be the most expensive 
part of supervised digital pattern recognition systems. The color maps produced 
by the system described in this section could prove to be very useful in identi- 
fying meaningful ground truth areas. This is true because a particular color on 
the map represents a localized region in the N-dimensional feature space asso- 
ciated with the N-channel multispectral data. 
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Fig. 11. Operator Processing Multispectral Scanner Data 
on Real-Time Interactive Color Display System 
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For many applications such as the production of land-use maps, the 
maps produced by this system may be the only type of processing of the 
scanner data that is required. In any event it seems clear that such a 
device would greatly increase the productive output of a group involved in 
the processing of raultispectral scanner data. 

In Section 7.1 the general method by which correlation clustering 
techniques can be used to produce color maps will be described. Various 
technologies, Including digital, optical, and analog, that might be capable 
of producing the color maps in an interactive and real time environment will 
be surveyed and evaluated in Section 7.2. A hybrid system in which the 
correlation clustering is accomplished with analog circuitry is described in 
Section 7.3. Finally, Section 7.4 presents conclusions and recommendations 
for future development. 

7 . 1 Correlation Clustering Images from Multispectral Scanner Data 

What does an N-channel multispectral image look like to a human observer? 
Or, alternatively, how can the information contained in N-channels of multi- 
spectral scanner data be presented in a form that is readily understood by 
a human observer? Inasmuch as the eye is able to distinguish a wide variety 
of color shades and hues it would seem advantageous to use a color display to 
present the multidimensional information contained in the scanner data. In 
particular, the goal will be to associate a given shade of color with a parti- 
cular localized region in the N-dimensional feature space. The size of a 
particular localized region and the color associated with it should be under 
the interactive control of the operator. 

The color c of a given pixel will be some combination of the three 
primary colors red, r , green, g , and blue, b . That is, 

c - C^r -t- Cgg + Cgb (7-1) 
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where C_, and C are the proportions of red, green, and blue 

R G B 

respectively. (For a color video tube C , C , and C could be the voltages 

KG 15 

applied to the red, green, and blue guns respectively.) The values of C^, C^, 

and C_ are determined by the following correlation clustering method. 

Let X be the N-channel spectral signature associated with a particular 
T C i) 

pixel. That is, x = [x. , x , ..., x ]. Let y be a reference spectral 
^ ^ n ~ 

signature associated with the color i (i = R, G, or B) . Let (J>^ (x^ - 

be a weighting function associated with channel j whose value is a maximum 

at X. = and whose value becomes small as lx. - y.^^M increases. (For 

1 J 1 J ' 

a possible example of the functions (J>, (x, - y for the case of 4-channel 

J J j 

data, see Figure 2 in Section 4.) 


The correlation function C. associated with the color i (i = R, G, 

1 

or B) is defined as 




N 

E (j) 
j=l ^ 


(X. - 


(i). 


(7-2) 


From the properties of the function it is clear that the maximum value of 
C . is equal to 


qMAX 

1 


N 

.2 <^.( 0 ) 

1=1 7 


(7-3) 


and will occur when the spectral signature x is equal to the reference 

(i) 

spectral signature y . It is also clear that a large value of will 

(i) 

occur when the spectral signatures x and y are similar, while a small 

value of C will occur when x and y^^^ are dissimilar. Thus, if the three 
i ~ 

(i) 

reference signatures y^ ^ are well separated then, for example, a pixel with 

(R) 

a spectral signature x = y would appear red. Similarly, pixels with 
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spectral signatures x = and x = ^ would appear green and blue 

respectively. Other pixels with arbitrary spectral signatures x would have 
colors given by (7-1) and (7-2) . 

R G 

An example of the locations of the three reference signatures y , y , and 

g 

y for the case of 2-channel data is shown in Figure 12. In this figure a 

"region of influence" is shown as a solid curve surrounding each color center. 

The size of each region is representative of the "widths of the corresponding 

weighting functions (j) . . In a real time interactive system the operator 

(i) 

would be able to vary both the color centers y^ and the size of each "region 
of influence" surrounding each color center. In this way the operator can 
watch as the display changes in real time as the result of varying the different 
parameters. Large regions of influence corresponding to wide functions will 
result in color displays in which large areas with different spectral signatures 
will appear as (nearly) the same color. On the other hand, narrow functions 
can be used to isolate in a single color only those pixels with a particular 
spectral signature. By this interactive mode of operation it should be possible 
to extract the maximum amount of information from the multispectral data in a 
minimum amount of time. 

The next section will consider a number of technologies that might be 
used to make the type of interactive color display system that has been described 
above. 

7.2 T teractive Displays Using Digital, Optical, or Analog Systems 

When thinking of an interactive color display that is to operate in real 
time one thinks first of a TV type of video display system. Assuming a 500 x 500 
pixel display that must be refreshed every 1/30 sec., one sees that a 7.5 MHz 
data rate is required to refresh the video display. Such systems are available 
and in use today. However, this will simply display a single image and does not 
process the multispectral data in any way. 
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Fig. 12 Localized Color Regions in a Two-Dimensional 
Feature Space 
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What is desired is to be able to change the correlation functions 
given by Eq. (7-2) in "real time" as observed by the operator. Suppose one 
tries to do this digitally. Assume that the calculation of a single value 
of requires only 5 basic operations each taking 1 psec. For ERTS data 

this calculation must be done for each of the four channels and the results 
added ^assume 1 ysec per add) to obtain in Eq. (7-2). Thus, it would 
take 23 psec to compute for each of the three colors. Therefore, each of 
the 250,000 pixels would require 69 ysec of computation which means that it 
would take over 17 sec. to change the video picture. This is obviously not 
Che real time operation that is desired. 

The basic problem with digital computations is that there are too many 
pixels (250,000) and one can l.nerefore afford to spend only about 1 psec per 
pixel if the entire calcT'iation is to be completed in some fraction of a second. 
This suggests that a substantial amount of parallel processing must be done 
if real time operation is to be achieved . Although digital computers with 
substantial parallel processing capabilities have been designed and built (such 
as the ILLIAC IV), there are major problems with their use and they would not 
be suitable for use in the small type of dedicated system envisioned here. 

Optical processing in one sense offers the ultimate in parallel processing. 

2 

The author has previously suggested a method by which holographic correlation 
techniques could be used to produce classification maps of a type similar to 
those described in Section 7.1. In such a system all of the pixels are pro- 
cessed simultaneously at the speed of light. However, a real time system would 
require a real time input transducer capable of changing coded data for all 
pixels at video rates as well as a real time medium for recording the holo- 
graphic filters. While a number of such real time devices and recording media 
are being developed in various laboratories, none at the present time possesses 
all of the properties that would be required for the type of interactive system 
being discussed here. 


51 



Additionally, In order to make a color display it would be necessary 
to construct an elaborate system containing lasers of three different colors. 

It is clear that such an interactive real time system using coherent optical 
processing is not within the current state of the art. 

Returning then to the color TV video display, is there any way that the 
processing described by Eq. (7-2) can be done in real time? The next section 
will describe a hybrid system in which this interactive processing is accom- 
plished with electronic analog circuits- 

7 . 3 Design of an Interactive Correlation Clustering Color Display System 

In this section an interactive system that will process ERTS multi- 
spectral scanner data in real time will be described. An overall block diagram 
^ /stem is shown in Fig. 13. The scanner data for an area of up to 

* pixels is transferred from magnetic tape to a high speed magnetic 
disk using a minicomputer which serves as a high speed buffer. Up to 250,000 
bits can be stored in a single track on the disk. Thus, eight parallel tracks 
can store the 8-bit per pixel data for an entire TV frame for one of the 
spectral channels. Thirty-two tracks can then store the data for all four 
spectral channels. The disk rotates at 1800 rpm so that data for a complete 
TV frame is read every 1/30 sec. 

The 32 bits representing the spectral signature for a given pixel are 
read from the disk in parallel with 32 fixed head transducers. This data is 
fed through four 8-bit digital-to-analog converters. Thus, four voltages 
(Vi, V^, V^, V^) representing the spectral signature of a single pixel are 
available simultanously , These four voltages are fed into an interactive 
analog processor containing analog circuits that process the data. This 
processor contains the interactive controls that determines the nature of the 
processing. The output of tnis analog processor consists of three voltages 
that go to the three color guns of the video display. 
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Fig. 13 j attractive Color Display System for Multispectral Imagery 
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The analog processor contains three similar circuits as Illustrated 
schematically in Fig. 14. Each of these three circuits is associated with 
one of the three colors (red, R, green, G, and blue, B) . Each of these color 
circuits contains two control knobs per spectral channel. Thus, there are 
eight variable controls for each of the three color circuits, or a total of 
24 control knobs for the entire analog processor. 

Each of the three color circuits making up the interactive analog pro- 
cessor is of the form shown in Fig. 15. The variable voltages V , V , V , and V, 

a D c a 

represent a reference spectral signature that is to be correlated with the 
spectral signature of a given pixel which is coming from the digital -to -analog 
converters. The gains a^, and of the four differential amplifiers 

are also under the interactive control of the operator. The values of the 
voltage at different points in the circuit are indicated in Fig. 15. An 
example of the four voltages entering the output summing amplifier in Fig. 15 
as a function of V^, V ^ V^, and for particular settings of 

and is shown in Fig. 16. It is .clear that the output of the summing amplifier 

is the correlation given by Eq, (7-2). Three such outputs from the three 
color circuits in Fig. 14 are then combined in a color TV tube to produce a 
particular color as indicated by Eq. (7-1). 

The entire 500 x 500 pixel TV frame is refreshed every 1/30 sec and thus 
the whole picture is changed in real time as the controls of the interactive 
analog processor are varied by the operator. These controls allow the operator 
to move the locations of the three color centers in feature space and to vary 
the size of the "region of interaction" for each color (See Fig, 12). The 
system described above for 4-channel data can be extended in a straightforward 
way to accommodate large numbers of spectral channels. Fixed head magnetic 
disks exist that could handle up to 24 channels of multispectral scanner data. 
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Fig. 16 Example of Signals Entering Output Summing Amplifier in Fig. 6 
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7,4 Conclusions 


This section has described a new method of processing multispectral 
scanner data in a real time interactive environment. The result of the 
processing is a color video display of up to 500 by 500 pixels in which a 
given color represents a particular localized region of feature space. 

Tlie size and location of these localized regions of feature space are 
under the interactive control of the operator. Thus, the user can elect 
to look at as broad or as narrow a region of feature space as he wishes. 

The interactive system for processing 4-channel data contains 24 control 
knobs that the operator can vary. In general the number of knobs will be 
6xN where N is the number of spectral channels. The ultimate goal would 
be to have the computer control the knobs (with perhaps some fine tuning 
by the operator). For example, ground truth information could be used to 
locate "interesting" regions of feature space that could then be painted 
with various colors. A whole new approach to the digital processing of multi- 
spectral scanner data will be concerned with how best to have the computer 
"turn the knobs" in order to produce meaningful motion picture classification 
maps of various levels of detail. 

The real time interactive system should be built in order to test the 
human reaction features of the system. It is expected that this system 
will effectively put the human brain into the data processing and pattern 
recognition loop. Since the operator views 250,000 pixels at a glance, he 
xidll be able to use the spatial information that is apparent to him to guide 
his way through the spectral feature space. After studying how the human 
operator reacts to this system an effort should be made to train the computer 
to "turn the knobs" and thus produce its own motion picture classification 
maps based on ground truth or other adaptive learning information. 
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APPENDIX A 


ALGOL Listing of CLUSTH 
Including Procedures 


HEAD2 

INPITTH 

CALCULATE 

DATA3 

ASSIGNH 

CTAPEHEAD 
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CLUSTH 


tREViSED VERSION! 


riLE IN 
n E iN^ 
Fl.t OUT 
FILE OUT 

1 ! 


^ILE 


BEGIN 

ERTS ? C2^500); 

CARD DISK{2# 10^ 30!i 
LINE PRINT (2»17); 

CTAPE 2C?»900»SAVE 99)T 
IN DI5KFILE CTAPETl 

there are 150 WRDS/RECD AND 300 WROS/RLK 
maximum logical records = 4 TIMES 200 = 800 
CTAPET DISK 14(200] C2»150#300)t 


glossary oe global variables 


real 

% 


CMi n; 


integer array cnroi50]j 


integer 


Integer 


% 

integer 

integer 

integer 

2 

integer 


REAL ARRAY 

% 

2 

K 


Integer 

% 

integer 
real array 


tncsi 


maxclust; 


NBACKI 


NCHANI 

NCLUSTi 

NROLDl 

ns; 

NSAMPI 

NWR04H; 


the correlation threshold, 

IE A CORRELATION C IS COMPUTED EOR^A^ 
GIVEN CLUSTER^ AND lE C GEQ CMIN, THEN 
THE PIXEL IS ASSIGNED TO THAT CLUSTER, 

AN array containing 51 WORDS EOR STORING 


HEADER INEORMATION AS DEFINED IN FIG. 5, 

FNORFC; % THE FINAL SCAN LINE OR RECORD NUMBER 
TO RE READ FRDM THE INPUT TAPE. 

FSfDSAMPJ % THE FINAL SAMPLF NUMBER DR PIXEL 

NUMBER TD BE PROCESSED IN EACH SCAN LINE. 

tncr; % THE Increment used in reading scJan lines 

FROM THE INPUT TAPE, IF INCR=l» ALL 

SCAN LINES ARE READ, IF INCR=2. EVERY 

OTHER SCAN LINE IS READ. ETC., ^ 

TNCS; % THF INCREMENT USED IN PROCESSING SAMPLE 

numbers in each scan line. ip INCSal#^^^ 
EVERY PIXEL IN THE SCAN LINE IS PROCESSED, 
IF INCS=2* EVERY OTHER PIXEL IS 

maxclust; % f Hp^^Axf M^M^ nS^BER nF CLUSTERS ALLOWED. 

IF THE PROGRAM TRIES TO CREATE MORE 
than MAXCLUST CLUSTERS THE PIXEL IS 
ASSIGNED TO AN ''OTHER” CATEGORY BY 
setting NSsO, „ , 

MR( % the NUMBER OF RECORDS TO BE SKIPPED IN 

ORDER TO READ SCAN LINE NUMBER NRECO. 

NBACK; % THE NUMBER OF CLUSTERS FOR WHICH A 

correlation is COMPUTED BEFORE A NEW 
CLUSTER IS CREATED, IF NBACK EXCEEDS 
THE CURRENT NUMBER OF CLUSTERS. THEN 
ALL CLUSTERS ARE CHFCKEO. 

nchan; % number of spectral channels on tape, 

NCLUSTi % THE NUMBER OF CLUSTERS THAT HAVE 

BEEN CREATED, ^ 

nrold; 2 THE Last scan line to have peen read, 

ns; % CLUSTER NUMBER. , , , 

NSAMPI % NUMBER OF SAMPLES fPlXELS! IN EACH 

SCAN LINE ON THE TAPE, 

NWRDAH; % THF NUMBER OF AB-BlT WORDS NEEDED TO 

STORE THE DATA IN ONE SCAN LINE, 

THIS VALUE IS COMPUTED IN HEA02 AND 
USED TO ESTABLISH THE DIMENSION OF THE 
array IDAT in CALCULATE. 

SlGtO( 12.0B200]; siQtJ.ivs] IS A two-dimensIOnal array 
containing THE AVERAGE SPECTRAL SIGNATURES 
ASSOCIATED WITH EACH CLUSTER. 

spectra? CHANNELS^AWD^fS^^COLUMN^ OE SIG 

STARTREC, * "ftUMBER 

TO RE READ FROM THE INPUT DATA TAPE, 

startsamp; % THE Initial sample number or pixel^, 

number to be processed in EACH SCAN LINE. 

WTnTHrO*243;% the parameters that CONTROL THE WEIGHTING 
functions in fach channel as defined 

IN FIG. 3. 


STARTREC; 

startsamp; 


LABEL 


ending; 




PROCEDURES HEA02. INPUTH. AnD CALCULATE ARE INSERTED HERE 




{ * it* i 


EWOIwG * 


it* * PRDG*?fl^ *** *** *** 

THF PPnCiRAh CLLJSTH PFADS A QATA TAPE rONTAlMlNG 

MIII.T ISprCTR.Al. scanner DATA AND PPHnljCFS A CLUSTER TARE 

CALLED ctapf On which each pixel has pefn assigned to' 
ONE Of maxclust clusters, each CLUSTFR contains pixels 

WITH SIHII AR SPFCTRAL SIGNATURES, 


HFAn?rERT,S,NCHAF#NSAMP)J 
T ndhth r 
CALCULATfI 
END nr PRnr.PAM, 


OBiGINAIi PAGE IS 
OF POOR <3 [UAIiM 
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HrAn?CFlLFNAMr^''C>^AN*MSAMp)r 


PROCrDliRT 


% 

ts 


THE PPnCFnuPE HFAn^ READS THE HEADER OE A DATA TAPE THAT 

has RFFM WRITTEM TN MRSYS-II eDRk'AT, FRDm This header 
IT PETERMInES the HHMnKR HE SPEETRAl CHAMHELS NCHAR, AND 
TMF "'UVRF.R HE PIVELS? NSAwpj U' A SIMPLE SEAN LTME. 

IT Ai sn rnnHuTEs mwroab. 


FILF 

w 

JNTErER 

INTErER 


E T L E A M E 1 

S' r H A ; 

N S A H P J 

BERT M' 


% THE FILE IOEnTIETER EDP THE DATA 
TAPE WHOSE HE A nr V? Is BE I ME, READ. 

% MUMHEP OF spectral PHANMELS OM TAPE. 
7 NIIHPER of samples (PIXELS') IM EACH 


iwterer array 

% 

TJ^tTEpER array 
Z 
% 

TNTEOER 


TprO:?on]j am ''RPAY tm which the dmpacked 3?-bit 

WPRDS OE the lAPSYS-U HEADER APE STDRED. 
iHEDtO : 1 3'M;? am array IMTD WHICH THE DATA FROM 
THE header Is read as iiNppncEssrn 
E'ILL AS-BTT WHRDs, 

wj 7 if'nrxiMr, variable 


format ruTCf x?o,*’hfadep record nr larsys-ii tapE”/>> 

EMTA ( XI 0. '*TAPF I 6# X 1 0# "DAT a IS C QMT I N'i A T I ON " 

" OF FLIDHT LTME STARTED DM TAPE*'. 1ft), 

EWT5CX1 n,**TAPE ^'lJMbER«, 16 ) , 

EWTft f X ] Op '*E n E MlJHBERs:'»p 1 6p Xl n. "RUN M DM BE R s'*/ I A / X 1 0 » 
"MUMRER he CHANNE LSs"p I 3 . X6p "NUMBER JE SAMPLES"* 

" PEP LlMF-"pTa)» 

EMT7(X10p"nATE"pX3p3l6/XIO*"A|TITt(DEs"*TlnpX9* 

" BROUN n HEAD I MBs'S 16 ) ; 


r 

MAIN UnnY of head? 


PEAn f filename* 1 3A, IHEOr* ) Jj 


FOR Ki=n STEP 1 UMTIL 20 

on 



REPLACE PPU'TEPC lOr* lp8) + B + ftxK 

BY pf'IMTERCTHEOriil,B)+Kx/; EflR 41 


M r M A ,\! I = I D c S I F 

MSAMP ! =TD t 6 1 ; 

MHPD4fi Sa(Nr:HANxMSAMP + a) 

n IV 6 + If 

’‘'RITECI. INEjEHTD ) J 

IE iniAl EQl 0 



them 


EL SE 

WPTTrfLTNE*EMT5*in[n*l0t4n 

I W R I T E f 1. I N E * E M T A * T D I" 1 1 • T n C A n ; 


WRITE ( LT JE * EMTft * T ur ?1 * I (> C 3 1 * T nf 5T * I DI ft 1 ) ; 

WRITE (LTNE*EMT7*TD[in*TnM2)*IDrl31*irCl5I*IDei6]); 
WRITE CLINFIPAGFI)! 

End 

pE HEAD?! 








PRnCFOURE tmphth; 

X the PRncEpURE TNPHTH READS VARIHUS IMPUT PARAMETERS 

t FROM SIX DATA CARDS- ALL VARIARLES APE GLOBALa 


integer 

INTEGER 


BERT M 
I ; 
Jt 


X IMDEXING VARIABLE- 
% IMDEXING VARIABLF, 


FORMAT IN 
PORMftT IN 
FORMAT OUT 


format Out 

format out 
format out 
format out 

format out 


FTM] ( 10153; 

FTN? ( 10 F 6 ,? 3 ? 

FOiiTl ("FOR THIS RUN’V/"NBACK r ”. 15 / 

•’MAXCLUST = NCHAN = ”.15/ 

"STARTREC = ENPREC = TNCR = 'N 


IS/”STARTSAMP = ”, 
'• INCS 5 ”,15//3; 


15.” FNDSAMP 5 ”.I5. 


” INCS s ”.15//3; 

Fn|iT2 (”THF ALLHWFD DELTA FOR FACM cHANMEL TS ” 
”A5 FOLLOWS” /”CHA N" » X5. ”nFI T A”/ ) ; 

FmiT3 ( I3.X5.F6-P3; 
rmiTAC”CMiN = ”.FA. 1 ); 

FnuTSC”riATA >ITLI RE WRITTEN ON FIlE NUMBER”. 

I5.”nM THE CLUSTER TAPE CTAPE”>; 

SAmPERR (” *** ERROR « * * ”. X f 0 , ”E N OS A MP IS ” 

"TOO LARGE .”/”£NDSAMP WUl BE READJUSTEO Tn ” 
”F<3UAL nsamp'm; 

T***************.-***** 


I 


MAP' Ponv OF INPIJTH 


NrLUST;^o; 
NROLD » = 0 J 


RFAD( CAR0»<PA6>. CTl rOl.CTU 133 1 
READ (CARO.FTNl .NRACK.MAXC! UST 3 ; 

READ (CARD, FTW2. FOR Jl:=l STEP 1 UNTIL NCHAN 
DO WTOTH[ J1 33 J 
REAHf CARD. FIN?. CMTN3I 

READ (CARD.FIMI.STARTREC.ENDREC. I NCR)? 

READ fCAHD.FiNl .START SAMp.ENDS AMP, INCS); 


WRITE ( LTNE.r OUT I.NRACK.MAXCLUST. NCHAN. 

STAPTRFC -EMOREC. INCR.STARTSAMP.EnDSAMP.TNCS); 
WR 1 T E C L T NE . rnUT/| , C M 1 N 3 ; 


If ENllSAMP RTP NSAMP 


THFN 


RESTN 

WPITF(L1NE.SAHPFRR3; 

FwnsAMp:=NSAMp; 

FNH 


El se; 


write (l iNE.FOuTn; 

WRITE (LINE.FnuT3.FfJR Jls = l STEP 1 UNTIL NCHAN 

no r Ji .wiDiHr Ji 3 n; 


ENH nr inputh; 
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PROCEDURE 

X 

% 

% 

% 


X 

X 

% 

% 

integer 

% 

INTEGER 

% 

% 


CALCULATFI 


THE PROCEDURE CALCULATE READS THE INPUT DATA TAPE A SCAN 
LINE AT A TIME USING THE PROCEDURE OATA 3 , IT THEN ASSIGNS 
EACH PIXEL IN A GIVEN SCAN LINE TO A PARTICULAR CLUSTER 
BY USING THE PROCEDURE ASSIGNH, 

BEGIN 

**'******^****** ***************** ****** 

glossary of variables local TD CALCULATE ^ 

AND GLOBAL 10 0ATA3# ASSIGNH# AND CTAPEHEAD. 
************************************** 


ARRAY lOATCOlNWRDAB]; % AN_ARRAY INTP wyiCH,TH 
ARRAY IDUM[0:NCHAN# 


% 

integer 

% 

|nI|ger 

integer 

Integer 

% 

% 

% 

integer 

X 

integer array 
integer 
integer 
^ntegeR array 


REAL ARRAY 
FORMAT OUT 

format out 


^ data from 

A SINGLE scan LINE IS READ AS 

- — - - A _ Fh ^ ^ ».l ^ 

*!nto 


IRECNO) 

I^^Smpj 

KNT j 


NPIXELSI X 
NSSrOlNSAMPl) 
NRECDJ X 

NSCANLINE) X 
NUMIOtMAXCLUS 


UNPROCESSED FULL 48*BIT WORDS. 
OiNSAMPllX A TWO DIMENSIONAL ARRAY 
WHICH THE UNPACKED 8"BIT BYTES. 

representing the spfctral signature 

nr EACH PIXEL IN A SCAN, LINE ARE STORED. 
THE ROWS CORRESPOND TO THE CHANNEL 
NUMBERS AND THE COLUMNS CORRESPOND 
TO THE SAMPLE OR PIXEL NUMBERS. ^ 

AN INdB !(iR{i§tl*CnRRESPONDlNG TO A 
SAMPLE OR PIXEL NUMBER,.. 

AN INDEX VARIABLE CORRESPONDING TO, TH| 
PIXEL NUMBER ASSOCIATED WITH THE CLUSTER 
NUMBERS THAT ARE WRITTEN ON THE 
OUTPUT CLUSTER TAPE CTAPE, 

X an array FOR STORING THE CLUSTER ^ 
numbers of each pixel in a SCAN LINE, 

AN INDEX VARIABLE CORRESPONDING TO A 
SCAN LINE OR RECORD NUMBER,. 

THE number of scan LINES THAT 


HAVE BEEN PROCESSED. . ,,,, 


HBER 

D 


TO EACH CLUSTER, 


SiGCOiNCHANfOlMAXCLUST] } 


RECDERR (”** ERROR 
"IRECNO»*»# 155 J 
neatly (2014)) 


**«#X5*"NREC0a«#I5#X5# 


PROCEDURES 0ATA3# ASSIGNH# AND CTAPEHEAD ARE INSERTED HERE 
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*** 


♦** MAIN qnOY OF CALCULATE *** *** *** 
NTTFl ON THF B=>500> ALL VABlAPLF*; 

ARE AUTOMATICALLY SfT TP ZERO 
before CXECUTlnw, TF THIS 
WAS NOT AUTDHATlCf THE ARRAYS 
NUM ANO SIG WOUm BF SET 
TO ZERO HERE. 


NPTXELSlaCENnSAMP-START5AMP)/lNCS+li 
HSCANUNEiaCENpREC-STARTREr )/U'CRf I J 


FOR NRECDibSTARTRfC STEP INCR UNTIL ENDREC 


On BEGIN 
KNTl=OJ 


0ATA3 

CERTS^NRFCD# IRECNPi; 


IF NRECD NFQ IRECNn 



THEN 


ELSEI 

BEGIN 

WRlTE!LlNE*RECDERR>NRECOtIRECNn)l 
GO TO FNDINGi 
FMO 


FDR XSAMP: aSTARTSAMP STEP U'CS UNTIL ENOSAMP 


no BEGIN 


assignm; 


NUMCNSI »=NUH[MS J + U 
NSSIKnT] l«NS; 
KNTi=KNT+ 1 t 
END! 


WRITE! LTNE»NFATuYfFOR JJt*0 STFP 1 
UNTTt MPTXELS-i DO NSStJj])* 
WRITE! CTAPFr^NPiVELS^NSsr*! ) t 
END! 


RFwI\'0!CTAplT 

L.rirK!ERTS); 


ctapfheadi 


END OF CAICULATF? 


StfiSS 
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ii3y2ats€ 


PROCEDURr 


0ATA3(FILENAME#NRECD* IRECNO)! 


THE Procedure data 

INPUT data tape WI 
16 BITS nr EACH SC 
IRECNQ. THIS VAUU 

calculate where it 

THE value of NRECD 
CONTAINS THE ROLL 
IS SIMPLY STORED ! 

IN EACH SCAN LINE 
spectral SIGNATURE-^-. 

DATA IS UNPACKED AnD STORED IN THE 


3 READS SCAN LINE 
TH IDENTIFTE 

AN LINE CONTAINS 
E IS PASSED BACK 
IS COMPARED (And 
, THE SECOND 16 
parameter, THIS 
N THE ARRAY WORKI 
CONSISTS OF 8-0IT 
s OF each PIX^L I 


NUMBER NRECD FROM^AN, 

R FILENAME, THE^FIRST 
THE RECORD NUMBER ^ 

TO THE MAIN BODY OF 
SHOULD AGREE)^W1TH^ _ 
BITS OF EACH SCAN LINE 

BYTES REPRESENTING THE 
N the scan line, THIS 
2-nlMENSlONAL A^RAY IDUM, 


integer 

% 

INTEGER 

% 


filename; 

IRECNOI 


X 


nrecoj 


THE FILE IDENTIFIER FOR THE DATA TAPE. 
THE scan line or RECORD NUMBER AS 
READ FROM THE INPUT DATA TAPE. 

AN INDEX VARIABLE CORRESPONDING TO 
A SCAN LINE OR RECORD NUMBER, 


BEGIN 


ntegI 

NTE6ER 

nteger array 


K| % 

KIJ % 

temp; % 

HORKtOtni % 


sg mn 

A temporary storagi 

A TEMPORARY STORAGI 


VARIABLE, 

AREA, 




MAIN BODY OF OATA3 


MR»aNRECD-NROLD-i; 
‘^PACE (FILENAME^MR)J 


READ (FILENAME»NHRDa8* IDATMD; 


FOR Kl=0 STEP 1 UNTIL 1 


Dn REPLACE POlNTERCWORKr*3/8)+4+6*K 
BY POlNTERt IDATC*],8)+2 kK FDR ?l 


IRECNOJ sWORKCOj; 
IROLLP* =WORKC n; 


ROLL parameter not USED 


FOR KS=0 STEP 1 UNTIL NCHAN-l 


00 BEGIN 

TEMP«= CNSAMP TIMES K) + Aj 


FOR K1J=0 STEP 1 UNTIL NSAMP-l 


DO REPLACE PO I NTER C I DUME K j * 3 ^8 ^ + 5 + 6 kK 1 
BY POINTERC iDATf *3 j»8)+TEMP + k: FOR II 


end; 


NROLD*=NRECDI 


END OF DATA3I 
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PROCEDURE 


assignh; 


X 

REAL 

INTEGER 

LABEL 

X 

X 


AWAY» 

X 


THE PROCEDURE ASSIGNH ASSIGNS A PIXEL TO CLUSTER NUMBER 
USING THE CORRELATION CLUSTERING ALGORITHM DESCRIBED IN 
SECTION 3 OF this REPORT, IT USES A WEIGHTING FUNCTION 
OF the type SHOWN IN FiG, 3 ‘ 

function for> at most, the 


NS» 


BEGIN 

c; 

j; 


away; 


i AND COMPUTES THE CORRELATION 
NBACK MOST RECENT CLUST^S* 


X THE CORRELATION FUNCTION 
X AN INDEX VARIABLE, 


MAIN BODY OF A5SIGNH 


FDR NS*«NCLUST step 


1 UNTIL 

IF NCLU5T-N0ACK LEO 0 THEN I 
ELSE NCLUST-NBACK 


On BEGIN 

c; = o; 


FOR Jl»l step 1 UNTIL NCHAN DO 


Cl=C + WIDTH[ JI-ABSCIDUMI J-l,KSAMP-n-SlG[ J,N53 ); 


IF C GEQ CHIN 
THEN 


ELSE 


end; 


BEGIN 

FOR Jl=! STEP I UNTIL NCHAN 00 


SiGt J,NSI l•NUM[NS3/{NUH[NS]+l )xSIGt J,NS1 
+ I0UMC j-i#KSAMp-n/tNUM[NSi + n; 


GO TO away; 
END 


IF NCLUST LSS HAXCLUST 


ELSE 


Nst»o; 


THEN 


BEGIN 

nclusti«nclust+i; 

nsibnclust; 


FOR Jl*l step t UNTIL NCHAN DO 


SIG[ J,NS1 i=IOHM[ J-1,KSAMP*1 i; 


END 


END OF ASSIGNH; 
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PROCEDURE 


ctapehead; 

THE PROCEDURE CTAPEHEAD IS USED TO WRITE THE riRST THREE 
RECORDS ON THE OUTPUT CLUSTER TAPE CTAPE, IT THEN COPIES 

THE cluster numbers NS5[*1 THAT 

PIXEL IN calculate PROM A DISK 


^HAVE BEEN ASSIGNED TO EACH 
PILE ONTO THE OUTPUT TAPE, 


integer 

% 


BEGIN 

U 


X AN INDEX VARIABLE# 


MAIN BODY nr CTAPEHEAD 


CTlE2I*=NCHANj 
CTH3] icNSAMPi 
CTl Nl laSTARTREC; 
CT1£51 loENDRECi 
CT1[6]I=INCRJ 
CTltTJIsSTARTSAMP; 
CTU6] icENDSAMPi 
CTl [9] t=INCS; 

cTino] i=nclust; 

CTl [ n ] tsNBACK; 
CTlfAOjt=MAXCLUSTI 
CTl [All (aCMINJ 



FOR I»«12 STEP 1 UNTIL U + NCHAN DO 


CTltnieWlDTHn-lt ]; 

WRITE(CTAPE#5l#CTl[*ni 


FOR lifl STEP 1 until^nchan on_ 


WRltECCTAPEpNCLUSt + LsiGn#*nJ 

WRITF.(CTAPE#NCLUST+I,NUMt*n/ 


FOR NRECD*=1 STEP 1 UNTIL NSCANLlNE DO 


BEGIN 

ENDI 

PEWINDCCTAPEJ^ 

1 END 

OF CTAPEHEAD! 




page B 
OP POOH QUALiry 
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APPENDIX B 


ALGOL Listing of GROUPL* 
Including Procedures 


HEADIN 

GROUPXMAIN 

READOUT 

GROUNDTRUTH 

COSTMATRIX 

POTENTIAL 

PTRAIN 

PTEST 

TAPEOUTPUT 

TRUTHMAP 

SAMPNOS 

PLOT 

TEST 


*GROUPL is a verson of GROUPX that uses the procedure 
POTENTIAL for classification. 
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PRODUCES OUTPUT CL ASS I E I CAT I ON TAPE 
FROM INPUT cluster TAPE 


z 


PILE 

riLE 

FILE 

rii.E 

file 

s 

% 

z 

REAL 


begin 

IN CTAPE 2C2»900); 

OUT 0U?APE^2(?ii6o?SAV^^99); 

OUT t-lNE PRINTC2M7); 

OUT DISC DISKmiO] ( 7M 30# SA VEs99 ) I 

GLOSS ARV*Or* GLOBAL*!^ 


Integer array 

REAL array 
% 


integer 

% 

% 

Integer 

% 


integer 

integer 


NTEgER 


integer 

% 

integer 

?! 

s 

* 

integer 

I 

integer 

integer 

Integer 

Integer 

% 

integer 

z 

integer 

integer 

integer 

integer 


C M I N ; 

CTHOiSOn 
STGtOi 12#0 

I NCR; 

I N c s ; 


j; 

K1 J 

K2J 

MATOT ; 

maxclust; 


nback; 


NCHANJ 

nclust; 

npixend; 

Npi ; 

NR2J 

nrecend; 
NRot d; 

ns; 

NSANP; 


% THE correlation THRESHOLD. 

IF A correlation C IS COMPUTED FOR A 
GIVEN CLUSTER# AND IF C GFQ CMlN# THEN 
^ THE^EJXEL IS ASSIGNEQ TO THAT CLUSTER, _ 
% AN array CONTAINING 51 WORDS FOR STORING 
HEADER INFORMATION AS DEFINED IN FlG, 5. 
l200]| X SiGtJjNS] IS A two-dimensional 
AR i?AY CONTAINING THE AVERAGE SPECTRAL 
SIGNATURES ASSOCIATED WITH EACH CLUSTFR, 
THE ROWS OF SIG CORRESPOND TO THE 
SPECTRAL CHANNELS AnO THE COLUMNS OF 
SIG CORRESPOND TO THE 
* THE increment USED IN 
FROM THE INPUT TAPE. 

SCAN LINES ARE READ, 
other scan lime is READ 
THE Increment used in p 


cluster number, 
reading scan LINES 

IF INCRttl# ALL 
IF INCR=2. EVERY 

J p F* T C 

ROCESSING sample 


INE. IF INCS»1# 

LINE IS PROCESSED, 
IS 


PIXEL 


LINE 


numbers in EACH SCAN Ll 

EVERY PIXEL IN THE SCAN 

JF lNCSc2# EVERY OTHER PIXEL 

processed# etc. 

% AM index variable, 

2 the Initial sample number or - . __ 

NUMBER TO BE PROCESSED IN EACH SCAN LINE, 
% THE FINAL SAMPLE NUMBER OR PIXEL 
number TO BE PROCESSED IN EACH SCAN 
Z THE number of CLASSES FOR WHICH 

ground truth IS being used, 

■% tME maximum nUKEIeR of CLUSTERS ALLOWED, 

IF THE PROGRAM TRIES TO CREATE MORE 
THAN MAxCLUST clusters THF PIXEL IS 
assigned to am »nTHER" CATEGORY BY 

setting NS=0. 

% THE number OF CLUSTERS FOR WHICH A 
CORRELATION IS COMPUTED BEFORE A NEW 
CLUSTER IS created, IF NBACK EXCEEDS 
THE CURRENT NUMBER OF CLUSTERS# ThEN 
ALL CLUSTERS ARE CHECKED. 

% NUMBER nr spectral channels dn tape, 

k THE NUMBER OF CLUSTERS THAT HAVE 
BEEN CREATED, 

a THE NUMBER OF PIXELS PER SCAN LINE 
DM CTAPE, 

% THE INITIAL SCAN LINE OR RECORD NUMBER 
TOREREAD FROM THE INPUT DATA TAPE. 

% THE FINAL SCAN LINE OR RECORD NUMBER 
TO BE READ FROM THE INPUT TAPE, 

% THE NUMBER OF SCAN LINES ON CTAPE, 

% THE LAST RECORD NUMBER TO HAVE BEEN 
READ FROM CTAPE, 

X CLUSTER number, 

% number of samples (PIXELS) IN EACH 
scan LINE ON the TAPE, 


X 

% 


pr5cedures*h|adin*^ 


ARE INSERTED HERE 


MAIN PROGRAM 

headin; 

™ grdupxmain; 

END OF GROUPL, 


OMGINAi; PAGE B 
POOR QUALHY 


jv;>e 


PRnCtUURE HEAl:L^> 

THE PRUCLOURE HEADTN TS USED TO READ THE FIRST TwO 

rfcorDs From thf input tape ctapEi the tmForhatidn 

CONTAINED IN THE FIRST RECORD OF CTAPE IS USED FOR 
niMFW«.I ONIKl arrays IN QROUPXMAlNf 


% 

X 


BEtiTN 


*** ** 


READ FIRST TWO RECORDS OF CLUSTER TAPE 


1 

! 


REAnCCTAPE»51,CTir*n; 


NCHANI =CT1 [? 3 ; 
NSANPlsCTl [ 31 I 
NRl :=CTU43; 
NR/?;=CTU5] J 
lMCR:j=CTH 6 n 
K1 .‘sCTUni 
k?:=CTH83 J 
INCS1P.CTU9 3J 
NCUISTtsCTltlOTT 
NhACK I =CTU n 1 J 
MAXrUlST l = Cl 1 r AO ] J 
CMlNlaCTHAl i; 
NRULDJ=03 


NRECEN 0 lB(NR 2 -MRn/INCR + i; 
NPIXENfit = CK?-K1 )/IHCS + l) 




P 1 UNML NCKAN 




irAI) (HTTPE* NCLUST+r/siGt j 

RFAn(CARDSA<I!>>^MAT(JT J; 


I 





I END DE HEAPlNi 


^<&s? 
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PHru,rnuRF r.Hnuf i n t 


THK pRtjrtnyfft ^iRIUJPX^tftlN F’HttPUCFS THv ll'lTPUT 
CLASSTt- IC aT I )IJ T/ipf utiTflPF FROM THF IsjP^lT CLUSTER 
TflPF fiflPF iSlNf*' tmf Fin 1 owing steps* 
n THi !-w.i ILimiMF fiWUiiMllTHUTH UStS GHrMJNO TROTH 
lNriiR»^ATlnN lu rnigjUNf T IfJN with the UATA on the 
C lUSTFw TAPE In HiJiMUirE A COSTHATPIX MATNfNS^'MTl THAT 
contains THF nu^BFR QF PIxElS KNOWN TO RE OF CLASS 
hat that have GFKn ASSIGNED T|] CLUSTER NUMBER NS. 

TIiL PROCFilURE rUSTMATRIX USES THE INFORMATION 
Tm ThT CftSTHATRTX mATmCnS^hATI TO FSTTMATF ThF A PflSTFRlORl 
PP(iHAB) L I T IFS [)" Cl lister NB hELONGINL, TO CLASS MAT, 

IF Enough griiunu irutm uata is available ano the 
A pnsTLHinRt prorabiltiy is sufficiently high# ail nr 
THE PIXELS JN cluster NS ARE ASSIQUeO TO CLASS MAT, 

31 those CLltSTFRS THAr WERE UNABLE TO RE CLASSIFIFO 
HY THE tOSTMAlPlX ARE CIASSJEIED USING THF MFTHOO OF 
POTFnTIaI.S GT IMF PROrEfjUftF potential. 

^ 1 ) THE RESUiriMG -“L ASSU ICATION Of EACH PlXFf Is WRITTEN 
r)N THE OlITPUT TAPF OUTAPF USING THF PRMCErMjHE TApFOtiTPuT , 

4 F G t N 

GLO'^SARY of VAHlARLrS lOCAl. TO GHOIJPXMAIN 

and GLORAl TO UFAmtilT, GR OiJND TRU I H t mSIMATRIX, 

PTTFNTTAL>rAPF)UTPUT,THillHMAPfANU fFSI. 

r»,**»*»#**»*****»»****^** 

X A parameter used IN THE OEFINITION OF 

X A^lnOL^^^UARI ARlPuSF6 to PRTNT HAP 
INREcFNO.O jnPIxFNO-1 3 J % AN ARRAY CONTAINING THE 
^LASS numbers assigned TO EACH PIXEL 
■<ATOT HinO>niA 31 XAN ARRAY TO STORE GRO'InO TrUTH 
INFORMATION FOR USE IN PROCFUURF TRUTHMAP 
S AN INDEX VARIABLE. 

S AN INDEX VARIABLE CORRESPOnO 1 NG TO THE 
CHANNEL NUMBER. 

) J NP ! XF nD-I I ! % AM ARRAY WRITTEN ON OlJTApF 

containing ihf class *!UMnrR5 That have 

HELM ASSIGNED TO EACH PIXFl. IN A 
SCAN LINE, 

1 THF weighting factor dSFD IN MOIMFYInG 

The potenful functiom in each error 

CORRECT, INC, I TF RAT ION, 

Y THE HtCiiNNiNG SCAN L 1 ME NUMBER (UN 
ORIGINAL DATA tAPEJ IN A GROUND TRUTH 
ARFA. 

% The fndIng scaim i.Inf nuwhfr (On 

ORIGINAL UAIA lAPE) IN A GROllNn TRI.'TH 

area, 

X an IHOF.X variable CORRESPONOiNG TO 

the Class numpfr, 

:N^LUST.[i sHATiIT i;s MAlNrNSfHATI TS AN ARRAY 
CONTAINING THE NOMaER OF PIXELS KNOWN 
TO RL jF CLASS mat that HAVF RFFN 
ASST'JNFO TO cluster NMMPtP NS. 

X IF the percentage HERCtSlNSl 15 LESS 
ThAm MlNPrR for Any CMi.slFR NS. them 
CLASSIFY tors CLltSTFR WITH POTENTIAL. 

% IF THE number up PlliELS wITHlN A CLUSTER 
FOR which tjRmiHil TRUTH EXISTS IS LESS 
THAN «iNTnr Then classify this 
"L iKTER with potential, 
y the wumblr of rf cohos to of skippfu in 

'IROFR T!f READ .-.CAN LINE NUMBER NRECl. 

1 THE number of data cards CONTAINING 

GRU'INIJ TRUTH iHEORMATinN ABOUT A 
M V F 1 Class, 

MATGTi; X AU array FoR STURInG the number of 
G sU'i'U) Troth caros fur usr in triithnap 
% the beginning PIXF.L NttMBLP ON ClAPE 
C’PPF SPliNDl NG ID A GHUUNU TRUTH APEA, 

J, AN INDEX VARIAfOF C ORR E S P O N 0 I N C. ID 
A PIXFL NL'-'BFr. 

X THF Ending PlXLl mUMHFR UN CTApF 

rCiERF sF’UNOImC, to A ground TRIJTh aria. 


r 



RFAL 


ALT AT 

X 

RDOlFAN 


BHL.F 

1 NTFc.lR 

array 

"L*SStO 

* 

INTEGER 

array 

c.uMi) r u 1 

integer 


IT 

Integer 


j* 

X 

TnTFGF r 

1. RL A V 

/ 1 u, s S r 

t 

r 

r 

RFAL 


LAMOAJ 

t 

i 

1 mTEGER 


i.r; 

X 

X 

T'!TF-,FR 


F } 

i 

IN TE(,ER 


. 1 A r J 

X 

integer 

ARRAY 

M A 1 N l f) 

X 

X 

X 

INTE GEK 


M I :.PLR 

X 

Integer 


mthtut. 

X 

r. 

z 

INTEGER 


wr; 

Y 

Integer 


M" .,RDS -' 

y 

X 

INTFGLR 

array 

'■'CPuSIOi 

r 

TNTFC.f R 


N^ 1 X 1 f 

/ 

I N T E L H 


N P 1 T 

X 

I HTf (i£R 


NP i 

V 

tntf gf r 


' ^ r ^ I 


AN ' 
PEC 


;OF X VARUbl.F 
'PO NllMGFR. 


CCiRRF SE’nND I NG TO f, 


DBIGINAL PAGE IS 
OF POOR QUALrry 
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tntfgfr 

% 

T ig T t fi t R 

?K|TEGFP 

7. 

% 

TnTFGFR 

* 

Integer 

% 

% 

ARRAY 

% 

% 

Integer 

% 

Integer 

integer 

% 

% 


ARRAY 

ARRAY 

ARRAY 


NPLrlJ * THE BEGINNING RECOKn NUMBER ON CTAPE 

CORRESPONDING TO A GROUND TRUTH AREA. 

N!RtC?J % THE ending RECORD NUMBER ON CTAPE 

CORRESPONDING TO A GROUND TRUTH AREA, 

NSHi % fHt 9EGINNING SAMPLE (OR PIXEL) NUMBER (ON 

ORIGINAL DATA TAPE) IN A GROUND TRUTH 
AREA, 

NSEJ % THE ENDING SAMPLE (OR PIXEL) NUMBER (ON 

ORIGINAL DATA TAPE) IN A GROUND TRUTH 
ARE A • 

NUMfO iNCUJST H % AN ARRAY FOR STORING THE NUMBER 
OE PIXELS THAT HAVE BEEN ASSIGNED 
TO EACH CLUSTER! 

pehcls[0;nclust3I5! perclscnsi is the maximum 

percentage Of PIXELS IN CLUSTER NS 
THAT BELONG TO ONE CLASS. 

PtXF LS[OtNPlXFNO-t j;% AN ARRAY READ FROM CTAPE 

containing the CLUSTER NUMBERS ASSOCIATED 
with EACH PIXEL IN A SCAN LINE. 

PIXTOTj % THE TOTAL NUMBER OF PIXELS PROCESSED, 

TRWNSPL OiNCLUST l;x TRANSPtNSl IS AN ARRAY CONTAINING 
THE CLASS NUMBER MAT TO WHICH CLUSTER 
NUMBER NS has been ASSIGNED, 


X 


7 


PROCEDURES HEADOUT, GROUNOTRuTH# 
TAPEOtlTPUT,TRUTHMAP,ANO TEST ARE 


CDSTMATRIX# POTENTI AL» 
INSERTED HERE. 


MAIN BODY OF GROUPXMAIN 
HE ADOUTI 

” groundtruthi 

’ R> A oTcTpd S*<I5>,MTNT0T)J 
PFAD(CARDS# <I5>^MINPER}) 

READ( C ARDS/<2F 10. A># LAMD AZALEA )j 

WPITECUNE#<X5»”MTNT0T=*’^ l6>#MtNTQT)I 

HRITECLINEf <XS. ’*MTNPER = "M5>#MTNPER)I 
tJRITL(LlNE^<X5,"LAMDA= 'S F7 , 2>. LAMDA ) I 
WR1TE(LINE?<X5^”AIFA= 'SET.Z^^ALFA); 

CnSTMATEIX? 

POTENTI All 

TAPEOUTPUT j 

! ROlisTRUEJ " 

I TRUTHMAP(NR1»NR2# !N CR# K1^K2^ I NC NRECEND* NP I XEnD# BQL ) J 


TRUTHMAP(NR1>NR2,TNCR*K1 >K2» 1 NC S* NRECEND/ NP I XEND ^ BOL) I 

If NR2^ TNCR^ K1#K2^ I NCSf NRE C FNDf NP I XEND# BOL ) i i 
I _ LOCK (PISC)I J 

END OF GROLIPXMAINI i 



ORiGIHAlj PAGE m 
OF POOR QGALITir 
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PRDCrDURE KEArnuTJ 

THt HfADnur reads the third record of ctape 

AMD WRITES the IMFORHATTON FROM THE FIRST THREE 
RFCnRDb UF CTAPE ON THE LINE PRINTER, 


LIST 


r^iRM A r 


r 


y. 


^^'^'^LlCFOR I* = 0 STEP 1 UNTIL H DO C T 1 [ U » NRECE ND» NPI XEND ) » 

L^JCFOR U = X2 S.IEP 1 UNT_LUXl4:i^Hft.N DO tl -11 1 CT U H I ) # 
LTcfOR Ts-j STFP 1 UNTIL NCLU!^ pD 

[ I#NUH[ n#MCMAN»FOR Jl«sl STEP 1 UNTIL NCHAN DO 

f 1 01 (Xsl »f APE 2A6»X5» »*NCHANs»'# I5#X5# <«NSAMPc«'# 15/ 


X5,*‘NR!»*’^ 15#X5/''NR28«> I5f X5 p"INCR = 'M5/ 
X5#"Kl6t"M5>X5#«K2fc'*#15/X5^ ''INCS = '*#I5/ _ 

X5#_’*N^LUST = *V I5#_X5>"NBACK = "> 15^ X5> »NRECFNP=”» 15> 
X5>"NPIXEMDa»* I5)» 
n02CIl Of Fto, 1 
F103C 17^ I 10^ X6,*I6H 


RFAD THIRD RFCORP OF CLUSTER TA RE 

REAPCCTAPE^N£lU5T+ljNUH[*n; _ 

WRITE OUTPUT FROM CLUSTER TAPE ^ 

WRITE( LINE/FIOI ^L1 )> 

WRITE( DISC, FiOi , LIT J 

WRITEC LlNE,<Xb#«MAXCLUST = "f I5>» MAXCLUST) | . 

HfUTEtLiNE* <X5 ,"CMINo ”, F7 ,_2>, C M I N } T j 

WRlTEt LINE,«/X5,”MAT0T*”, I5>,MAT0T)T 

WRITEaj;NE,<//X9^”I”,XA,”0ELTAf | 

WRITE(LTNE,F102,L2)J , ! 

WPITE( LTNE,<X5,”NS”,X5,”NUMINS5”,X8> 

"VALUES 0F_ SlatNSjJl">)I 

WRITE CL T NE,F103,L3); 

I«-0 step 1 UNTIL NCLUST DO 

PrXTOTiePiyTOTfNUMC ni j 

WHlTEaiNE,<X5, "THE TOTAL NUMBER OF P I XELS = *S 1 6>, PI XTOT ) ;j 

WRITECl INF»</X5,"THE NOMBER OF UNCLASSIFIED PIXFLS«", j 

I5/>,NUMtO ] >; I 

i END OF HEADOUTI | 

_J 
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[’wOCEDURE GR^U^lftThUTH; 


I TST 
FnRMAT 


STPPF iMFORMATIriN IN CUSTMATRIx HATNtNS#MATJ, 

READ GRnOND TRUTH aNR COUNT PIXEL ASSIGNMENTS 

9FGIN 

LAILB>U#NSP*NSE ) t 
n04C415); 


FQR HATti:! STEP 1 UNTIL KATOT DO 


BEGIN 

PEAn(CARDS,<I5>/NCAR0S)I 


WRIU(LU'E»</X5,»THF F0LLGWING»W4, 

” GROUND truth SITES CONTAIN CLASS NUMBER" » I 3/> > 

NCARDS# MAT ) I 

WRITE! LINE# <xa» " beg I MNI MG"* X 2f "ENDING"* X3f 

"BeginninG",x2,"enoing"/X4*"scan line"* 

Xl*"5CAN LINE"* X2* "ELEMENT "* X3* "ELEMENT ">) I 


FOR Iii’l STEP 1 UNTIL NCAROS 130 


BEGIN 

READ!CARDS*F10A*L4)I 

wR1TECLINE*<4H0>*lA3) 


LBi=IF lb LSS NRl THEN NRl ELSE 
LEJsIF LE GTR NR2 THEN NR2 ELSE 
NSB. LS5 Ki THEN Kl ELSE 


LBI 

LEl 

NSBI 

NSFI 


NSEt=lF NSE GTR K2 THEN K2 ELSE 
NREC 1 I = INTEGER t (LB-NRl ) /INCR + l ) J 
NREC2l=ENTIER!aE-NRl)/lNCR+lJJ 
NPIXl t=!NTEGERC(NS0-Kl )/INCS+l 3 J 
NPlX2lcENTIERCfNSE-Kl)/lNCS+133 
GRNO[ MAT* I » 1 I ;=L0> 

GRND[MAT#I*?]»*LEJ 
GRNOf MAT* 1*3 J i»NS8J 
GRNDfMAT* I*a] i=NSF3 
NCROStMAT ] ! =NCAROSi 

MR!sNRECl-NRQLO-ii 

SPACF(CTAP£*MR)| 

FOR NRECt=NRECl STEP 1 UNTIL NRFC? DC) 


1— 


BEGIN 

READ(CTAPE*NPIXEN0*PIXELS[*])3 


FUR NPIXI=NPIX1 STEP I UNTIL NPIX? 00 


MATNCPI XFLStNPl X«»l J*MAT3 

t =MATNtPlXELS[NPI X-i 3*MAT3+W 


ENOf 

I I- - 

! nrqlo!=nrecp; 

j L END J 

ENOJ 


. .J 


jEND OF GROUNOTRUTHI 


] 


jORIGINAIi PAGE IS 
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PIXELS IN 
GROUND TRUTH 


procedure COSTMATRlXt 
If 

X PRINTS 

X MATNINSj 

1 To'^PELQnG"TO MATERIAL NUMBER MAT, , , 

X FOR EACH ROW NSo . TR ANSPF NS 1 IS_EyUAL TO JHE COLUMN 

% - - - ^ 

2 

% TO material number MAT. 


S OUT the COSTHATRIX MATNtNSjMAn. 
ns^MATI is. equal TO THE NUMBER QF 
FR number ns that ARE KNOWN FROM G 


FOR EACH k! 1« N5<> lU inr- i-'tuuM* 

number mat containing the largest value of ‘^A ^NCNS,MAT ] , 
THIS MEANS THAT CLUSTER NUHBEF NS HAS BEEN ASSIGNED 


BEGIN 

r,L 0 SSARY* 0 F**ARlARLES*LnCAL*T 0 *C 05 TMATRlX, 


•tt ** 


% HEADING OF the COSTMATRIX, 

Integer array KnuNTtoiMAToni * an array containing the integers 


INTEGER 

% 

ARRAY 


REAL 

* 

% 

integer 

% 

% 

ARRAY 

% 

% 

REAL 

array 

% 

array 

% 

% 

% 

REAL 

FORMAT 


I TO MATOT FOR PRINTING ON THE 
MATNAX; X THE MAXIMUM ENTRY IN A GIVEN ROW OF THE 
COSTMATRIX MATNENS-MAT],, 

PERHITCOIMATOTJJX the percentage of pixels THAT ARE 

actually of class mat that have been 

CLASSIFiEO AS BELONGING TO MAT 8Y 
THE COSTMATRIXs 

THE average percent CORRECT CLASSIFICATION 
§?*‘COsVi*AffelX®^* CLUSTERS CLASSIFIED 

THE NyHBER_gf’SPACES TO SKIP.JJN PRINTING 


PERTOTI 


SKIP! 


SUKtOlMATOTlI* 


OUT THE COSTMATRIX FOR 
number of CLASSESj 
THE number OF PIXELS 


A variable 


NUMBER OF PIXELS OF A GIVEN CLASS 
THAT HAVE BEEN ASSIGNED TO THAT 
CLASS B-y THE COSTMATRIX, 

TOTALI 3! THE SUM OF TOTNStMATl OVER ALL CLASSES, 

TOTMCOlNCtUSniX AN ARRAY CONTAINING THE SUM OF EACH 
ROW IN THE COSTMATRIX MA TN f NS# MAT ] , 

tdtnscoimatot m an array containing the sum of 
EACH COLUMN IN THE COSTMATRIX 
MATNtNStMATl# EXCLUDING THOSE CLUSTERS 
TO BE CLASSIFIED BY POTENTIAL, 

TOTSUtH * T^4E SUM Of SUM! MAT] OVER ALL CLASSES, 

F1020CX3P "CLUSTER/CLASS"# *16# X*# "TOTAL"# X5# "CLASS"# X3# 

"PERCENT"}# 

X9#*I6#X*#F10,0# I10#F10,2}# 
'F6i0#X*#F1OjO}# 

*F6oO#X*#FlO,0)# 
n 024< /XO# "PERCENT" #X 6 ff*F 6 9 1# X^jFlO, 2 )/ 


FtO22</X5#"TOrAL"#X7#*F^,0#X*#Floio5# 
F 1 023 "CORRECT"# X6#*F6 1 


FORMAT 


F 1 025 C/X4# "PERCENT "# X6# *F6, 1# FI 0,2) I 
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MAIN 0ODY CF COSTMAtRIX 

FIND the maximum E^TRY IN EACH ROW OE' MATNlNS^MATl 
AND SFT THE CCRRE^PONn I nG VALUE OF MAT EOOAL^TD _ 
TRAMSP[NS3i the RATin OF THIS MAXIMUM NUMBFR TO THE 
TOTAL UF EACH ROW* STORED AS A PERCENTAGE IN PFRCLSlNSIf 
IS A MEASLIRE OF HOW GOOD THf CLUSTER IS FROM THE PniNT 
□F VIEW OF CONTAINING PIXELS QF ONLY ONE CLASS. , 

ONLY CLASSIFY THOSE CLUSTERS THAT HAVE AT LEAST mINTOT 
GROL'MO TRUTH PIXELS IN THTM* THE REMAINING CLUSTERS 
WILI Pi CLASSIFIED BY POTENTIAL. 


FOR NS:=1 STEP 1 UNTIL NCLUST DO 
Tegin 

MATMAXiaOl 

FUR MAItal STEP ,1. UNtiL MA TQT DO 
BEGIN 

IF MATN rNS>MAn QTR MATMAX 
THEN 


... J 


else; 


BEGIN i 

matmaxi=matn(ns.matt; j 
transpi NS 3 : -hat; ! 

ENO 


TOTMINS J : =TnTM[NS]+MATN[NS»MAT3; 
end; 


IF TOTMTNS] LSS MINTOT 


; I 

then ' ' ' 1 

; i 

TRANSPCNS3 1 = 
PERCLbCNS3*»0 

FLSE 

f 

PiRCLSlNSIis 
MATKAX/TDTMINS 3x100? 


,FN0; j 

1 1. 
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VCj<; 


COSTMATRIX (CONT,) 

FOR FACH CLASS MAT# THE PERCENTAGE CORRECT 

CLASSIFICATION AS MEASURED BY THE NUMBER OF GROUND 

truth pixels in THOSE CLUSTERS ASSIGNED 

DIVIDED BY THE TOTAL NUMBER OF PIXELS IN ALL CLUSTERS 

THAT ARE- known TO BELONG TO CLASS MAT# IS STORED 

IN the array PERHITTHAT]. 

the overall correct CLASSIFICATION FOR ALL 

CLASSES 'S GIVEN BY PERTOT. 
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COSTMATRIX (COM, ) 


WRITE OUT the COSTMATRIX AND ALL TOTALS, 


HRITE(LINEERA6E3 )1 


FOR MATia 1 ST E P 1 UN TIL M^TOTOO 

KOUnTCMAT3i°MATJ 

SKiPlaC 10«MAT0T)K6+1J 

WRITE(LINEiFi020#MATDT# FOR MAT*»l STEP 1 UNTIL MATQT 
DO RoUNTEMAf] #SKIP»SH 

FOR NS*ol STEP I UNTIL NCLUST DO 


HRlTECLINE^F102UNS#MAT0Tf FOR HATl = l STEP I UNTIL 
MATOT DO MATNCNS#MAT]#SKlP»TOTHCNS]^TRANSPtNS]# 
PERCLSEN S))J 

WRITE(L!NE*FI022#MAT0T#F0R MATi»i STEP 1 UNTIL MATOT 

DO TOTNStMAT WSKIP£T0TAL)^ 

WRITEtLINE>FlG23»KATOT#raR MATjai STEP 1 UNTIL MATOT 


DO SUMtMAT3#SKIP*TOTSyM); , , 

WRI TECLINE# F I 02ApHATDT# FOR MATi=l 
DO PERHIT[MAT].SKIP#PERTOT) j 
WRITE(0ISC#Fi025#MAT0T/F0R MAT i«I 


DO PERHIT[MAT3#PERT0T)I 


END OF COSTMATRIXI 


FOR MATi=1 step 1 UNTIL MATOT 
PERTOT)j 

FOR MATioI STEP I UNTIL MATOT 


PRnCFDL'RE POTENTIALS 


PPOCEDUPE POTENTIAL SORTS THE SPECTRAL SIGNATURES 
HSSOClATEn WITH EACH CLUSTER INTO TWO GROUPS. JNE 
SIGNATURES OF CLUSTERS THAT HERE CLASSIFIED BY COSTMATRIX 

M stureu in the array xn^j^Kj and are used for^ 

TRITNING THE POTENTIAL CLUsrhER Tn THE PROCFDURE 
PTRAIN. THE SIGNATURES OF CLUSTERS THAT WERE NOT 
CLASSIFIED BY COSTMATRiX ARE STORED IN THE ARRAY 
YIJ^KT AND Are then classified in THE PROCEOURE PTEST. 


BEGIN 


glossary of 

*** *-h 


variables' LOCAL To ‘PUTTWriALi 


INTEGER ARRAY COUNT [ 0 IMATOT# 0 INCLUST ] J % COKNTLIfJJ IS THE NUMBER 

FUNCTION 


% 

% 

X 

X 

ARRAY 

X 

REAL 

% 

integer 

X 

?ntec.er 

X 

integer 

% 

integer 

% 

INTEGER 

X 

Integer 

j: 

S' 

INTEGER 

% 

integer 

% 

integer 

X 

integer 

% 

% 

integer 

% 

PEAL 

% 

% 

X 

array 

X 

% 

ARRAY 

% 

% 

integer 

% 

X 


ARRAY 


GfOiMATOniX 
GMAX; t 
IFLAG) % 


I i 

IGMAX; 

jy 

KJ 


X 

% 

% 

% 


OF TIMES THAT THE.POJENT I AL , ^ 

AUGMENTED^BY^LiiMDA IN ORDER TO^CORRECTLY 

^h^;TkfSfN'I^SE'^"^l?6IMINANT 

function for each CLASS. 

THE MAXIMUM VALUE UF THE DISCRIMINANT 
FUNCTION. 

A flag TO determine WHEN ALL TRAINING 
SAMPLES ARE CORRECTLY CLASSIFIED BY 

AN'^f5|JEx''0ARTABLE CORRESPONDING TP 
A CLASS NUMBER. 

THE NUMBER OF THE CLASS WITH THE LARGEST 
DISCRIMINANT function, ^ , 

AN TNDEX VARIABLE CORRESPONDING TO THE 
CHANNEL NUMBER, 

AN INDEX VARIABLE CORRESFONOlNG TO 

A sample KUM_BER 


ARRAY KEEPIOJNCLUSTII % An ARRAY 


[TAlNiNG THE 


CLysT|R 


KSWJ X 

kt; * 

u % 

ARRAY N[OrMATOT]JX 

NSAMPKI % 

sum; % 


NUMBERS FOR eS? 3 CLUSTER THAT Is TO BE 
classified using the method UF POTENTIALS. 
A COUNTER TO LIMIT THE TOTAL NUMBER 
Of ITERATIONS iH PTRAIN TO fOO. 

AN INDEX VARIABLE CORRESPONDING TD 

A sample number 

AN INDEX VAgiABLC CORRESPONDING TO 
A CLASS NUMBER. 

Nfll IS an ARRAY CONTAINING THE NUMBER 

nr samples labeled class i that are 

THE DISTANCE IN FEATURE 
SPACE between a cluster training SAMPLE 

and a cluster sample to be classified 


IS THF 
K TH SAMPLE 


BY potential. 

XrO!HATQT^O;NCLU5T#OiNCHAN]JX X[I»J#XJ 
5PF,CTRAL signature Of THE 

YrOiNCLUS7,0 inchan*J;% ^an^array containing the spectral 
signature of each CLUSTER To BE 
CLASSIFIED BY POTENTIAL. 

WT [ 0 smATOT. 0 J NCLUST ] ; X weighting factot accounting 

FOR THF VARYING NUMBER PL 
pixels In each cluster. 


% 

X 

% 


procedures PTRAIN AND PTEST ARE INSERIFO HERE 
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HAIM BODY UF POfEMTIAL 


FOK lt = l STFP 1 U^JTIL MATOT t)d 


rOH NS :=1 STEP I UNTIL NCLUST DU 



I>N[ I ] # JMaSIG 


NSAMPKtaKI 

WR IJEC L I NEf <y X5# " ! X5 ^ "N £1 J"/> ) 7 


FOR I;b1 step 1 UNTIL MATOT DO 


wRITE(LINE#<I6j I10># I#M[I 1)1 


PTRAIN) 




PTtST) 




END OF potential; 





"A 





f'fjncFDuRL Hii.fliN; 



ORiGINAIi PAGE IS 
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procedure: ptest; 


THE PRCJCEDUFE PTEST USES THE DISCRIMINANT FUNCTIONS 
CALCUI.ATED in PTRAIN to classify the CLUSTERS^ WITH,^ , 
SPECTRAL^GNATURES stored in THE„ARRAY^Yr J,KT1# THAT WERE 
NDT CLASStFlTID BY THE PRDCFDDRE C05THATRIX* ■ 


REG 3 N 

- y FPR STEP 1 tlNTlL NSAM-PK DO _ 

BEGIN 

GMAXJ= 0 ; 

FOR I?-l STEP 1 until MATQT DO 

’{iii j:=pi . . , : \ 


FOR I*al STEP t UNTIL HATOT DO 



IGMAX isO 


FOR Kt = l STEP 1 UNTIL NlH DO 

BEGIN ' J 

SUM*=p; 


FOR J«=l step 1 UNTIL HCriAN DO I 

SUM|=SUM + CYIJ<TaJ1-X . ... L_- 

G[ I ] leGC IT + C C 1 + LAHOAxCOUNTC K] )/C 1+ALFAxSUM) 


WTn#Ki; 


Gt IllaGn l/Nt I 
IF.Gtn QMAX 
THEN 


OF GIN 


‘'"SASx.^Gctj; 


TRANSPI:kFEP[K.T m=tGMAXI 
END? 


end of ptestt 










pRucEnuRE rAPE:rnJTpu^^ 


THE PRUCEnURE TAPfnUfPUr PPODUCES THE OUTPUT TAPE 
OUfAPF. THE FIRST RECORD OF QUTAPF CONTAINS THE CONTE 
OF CTU*]. EACH SKCCFEDINQ RECoPD CONTAINS THE 
CLASSIFICATION OF EACH PIXEL IN A GIVEN SCAN LINF.^ 

THE ARRAY PIXELS!*! CONTAINS THE CLUSTER NUMBERS 
FOR FACH PIXEL IN A SCAN LINE ANO^THE ARRAY KLASSUI 
CONTAINS THE CORRESPONDING CLASS TO WHICH EACH PIXEL 

HAS been assigned pY FITHER COST MATR IX OR PPTFNTIAL, 

"hegin 

WRiTEC0UTAPF>51j'CTiC*l)I 
SPACECCTAPEf -NROUD); 


NTS 


FOR NREC:»! STFP 1 UNTIL NWFCEMD DO 
HEGIN 

READC CTAPE^NPIXEnD* PIXELS! *3 


FOR NPlX»=0 STEP I until NPIXEND-1 DO 


KLASSENP1X3 * 3 transpc pixels CnPIXI i; 


i 

I 


WRIT ECOUTAPF^ NPlXENDiKLASSC* 

EW 

WRITECLINE^ <XSf I5 b"SCAN LINES*EACH CONTAIN I NG">T5j 
•’PIXELS HAVE BEEN WRITTEN ON OUTPUT TAPE"># 




li. 


END PF TAPEOUTPUT; 
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% 

*■**★1(1* + **** A* 

*********** 

Procedure truthmap 

(NRJ* 

MR2» INCR?K1,K2* INCS* NRCD*KSMP>BnL)I 

INTEGER 

% 

NRi;' 


% beginning record to be 
processed 

INTEGER 

.NR2J,:, . 


% final record to bf processed 

tnteger 

incr; 


X THE scan-line INCREMENT 

integer 

Ki; 


.. .% BEGINNING SAMPLE JiUMBER 

inteofr 

KP; 


% ENDING SAMPLE NUMBER 

tmtfgfr 

INC SI 


X sample NUMBER INCREMENT 

integer 

% 

NRCOJ 


X number of records to PE 
PROCESSED 

tnteger 

ksmpT^ ~ ' 


% NJHBER OF SAMPLES TO "BE 
PPOCFSSEO 

pnOLEAN 

: BOLI 

% . A 

boolean variable^ used to print map 


BEGIN 


11 

It 

II 

11 


M* ^ OT Ito- M 

srssB3SSESSK;B = xssrss;£:a==sssr7srs*:s=s:isss; 

integer 

array scaleioj 3*oiKsmP 

Div 531% MtRiJT scale FOR THF SAMPLE 

¥ 



NUMBER AXIS 


5^ procedure SAMPNOS INSERTED HERE 


*f- procedure PLOT INSERTED HERE 


real array C0rtBHAPl^*NRCD#O»KSMP3l“^f DSED TO PRINTOUT THE SPECIFIED 


EAL array CHARCl*91I 

tmteger array PRctoisoTi ■ 
Integer array frccoisoji 
Integer array bshpeotsojj 


area, cluster numbers are 

ARE PLACED AT GROUNDTRUTH 
LOCATIONS 

% USED TO STORE 1 THRU 9 AS 
CHARACTER DATA 

■^rSCALlNG ARRAY ^ ^ 

% SCALING ARRAY 

% SCALIN&“ARRAY 


INTEGER Array ESMPI0T5O]; % SCALING ARRAY 


integer uj,K/l; ” x counters 

FORMAT IN FMT3:(X20#2l5>X5^2!b)J 


format OUT TMTSC' THIS IS THE CQMSiNEO GROUNDTRUTH MAP, «#///>; 


■*** main BODY OF GROUNblRUTH ***. ! 

IF SOL 



■ t 

THEN 



.* 

/ 'SEG.iN ■ 



A 

1 

1. 
. T 

> 


p A GP to 



*** iMtnAsi/rs iHh: *** 

rnti ji = i STFP 1 ufjriL mrcd 

^jFTirH' ■ i< r= r~s fTp ijr^t rr' ksm> 


on COMBMAPrj*K 

*** SETS Lfp' Ttir i!*ASACTF'IJ vrc^nR *** ' 

FILL 1'ITh >>t "> «4«, tigi»; 


*** GEsjERATfS THE SCALE ALOMG THE SAMPLE NUmHER AXIS *** 

SAMPMDStKr>KSMP> IN'C Sy t 

*♦* PROCESSES The Classes *** 

FOR us=l STFP~T ItMflL MATOT 


on Pi GIN 

FOR l* = j STEP 1 iif^niL "iCRnscu] nti 

r ke7,in 

rBRCin» = ir GRNOrL^IM] LSS NRl THEN NRl 
; El SF GRMDtL*I»ni 

.BRCin» = lMTEGERCCaRCm»NRn/INCR+n; 

' FKCtIJJ = IF r,RNDtL#F^?3 GTR NR2 THFW NR2 . . 

ELSF GRHOn.M#2T; 

: ERCtnSsCERCirj-^Rl) DIV iMCR+tF 

RSf-ipcijislF GRHOrirIf33 LSS Kl THEM Kl 

j - ELSE GRNDCL^I^ 33; 

^ 8 SM?t I ] :-lMTESERC< 8 SH?E I ]-Kl J/IHCS + l ); 

I ESMPnj:= IF GRMOlLfTM] GTR THFM K? 
i ■ ElSF GRMDtL#I*A]? 

: es«pe nt = (FSMP[ ij-Ku niv r.vcs+u 

__EMO.F . ■ ' . . 

♦ ** assigns The class .MUMRFR to CllMRMAP AT GRUUNf) t)ATA 
LnCATHlMS *•* 

FOR l!=l STEP 1 UNTIL MCRDSILl 

>nR_ jiseRcri] STEP i until ercui . 

; r)ij KHR iftsRSMPm STEP 1 UNTIL ESHPEI] 

■ jT)'n‘'rtFGl’V 

cnMRMAPt J,K3i=CHAR[Li; : 
i FfJDt 


4 --, - 


r E Nor ' . ■ ■ . : 

*** PRINTS OUT TnE COMRfNFt) GHnjHflTRUTH MAP *** 
WRITECLlNE'TPAAFrj;' 

_wRl T£ f I- lN £.>rM TP ) ; 

PI uT (cnM-j RAP) ; 

FL ■ ■■ 

~RES'I\ ■ — ■■■ ■■ ■ ■■ 

-rtSITECL INFrPAGE] )f . 

SA*^tM JSC KT# SS«P* IMCS ); 

PlOTCCLASSIT 

LNUI 


ENH OF IPUTHMAP; 


F 

u 

r 

I 


88 



PKDCt DURt SAMPNO'SCkI^KSMP^ U>CS)i 

TmTFGER kJ; % beginning sample number 

Integer ksmpj ■ % number of samples to be 

* PROCESSED 

Integer Twgsj % sample number increment 

BEGIN 

Integer tfmp; % tnitialized to the beginning 

» . SAMPLE NUMBER/ THEN INCREMENTED 

% TO CREATE THE SAMPTE NUMBER 

% SCALE 

INTEGER I#Ji % COUNTERS 



*********** ********^> *********** 

*** MAIN BOOT OF SAMPMDS *-** 

TemPIsKU 

FUrt I* si STEP 1 UNTI L K 5 MP pIV 5 

[do begin 

I *** CONVERTS A SAMPLE- N UMBER INTO A COLUMN VECTOR 
j SCALEEl# n J=TFMP DIV 100 > 


SCALEC2*n:=CTEMP MQD 100) OlV 10 ) 
SCALEt3»l3»=TFMp MOD 10) 

, TEHP1JSTEM.P + 5XINCS) 

END) 


-EiVD^nr _ S AMP NOS) 












PROCEDURE 


PLUUMAP) J 


INTEGER ARRAY MAP[0#0]J 

begin 


INTEGER 


NTEGER 
FORMAT OUT 


SCLJ 


W J> K» 


FMT3CX5M20AI) > 


% SPECiriEO AREA TO BE PRINTED 


:: e s; s c s 


% initialized to the INITIAL 

RECORD NUMBER^ THEN^IS^ 

incremented to generate the 

THE SCALE ALONG THE RECORD 
NUMBER AXIS 
% COUNTERS 


FORMAT OUT 


FMT4(X5#120T1); 


FORMAT OUT 


FORMAT OUT 


FMT6CXUI3)i 


FHT7(XSi25 { I TjXA J )} 




****** **** ********************* 

*** MAIN BODY O F PLOT *** 

*** PRINTS OUT THF SCALF ALONG THF SAMPLE NUMBER AXIS 
FOR Ji=l STEP 1 UNTIL 3 

[nn WRI rE(LlNE#FHT7»FDR step 1 until KSMP OIV 5 

I ■ ; . ■ on SCALE tJ^IDI 

SCLJ=NRt; 

FOR Ji=l STEP 1 UNTIL NRCO 
00 BEGIN 

*** prints OUT THE RECORD NUMBER EVERY FIFTH PASS 
IF J MOD Vifll 


THEN 



BEGIN 


WRITECLlNEtN0]^FMT6,SCL)li 
SCL«aSCLt5><lNCRf : 


FL5FJ 


I *** PRINTS OUT A SCAN*EINE *** 

j IF BOL ~ 

r THE N 


WRlTE(LrNF>FMT3^F0R K*-l STEP 1 UNTIL KSMP 
: 00 MAPI JjKl ) 


else 


WRITE(LINE^FHT4,F0R KtsO STEP 1 UNTIL KSMP'^1 


DO HAPfJpK])) 


end; 


END OF plot; 


PROCEDURE test; 

8EGIW 


INTEGER 
Integer array 

ARRAY 

% 

array 


col; 


AN INDEX variable 


ERRMATtOl lO>OllOi;9J THE CLASSIFICATION ERROR MATRIX 
PFRCORtO;lO]^3: percent of TRUE CLASS THAT ARE 

rl A^^TPTcn rnRRFPTi v 

PFRCORCOLCOIIOI; % PERCENTAGE OF PIXELS WE CALL MAT 


integer Array sumcos20i; % 
integer siiMROw; % 
INTEGER Array TnT[Ot20i; % 


^ THAT are REALLY MAI 

SUMC0S201; % THE SUM OF EACH ROW IN ERRMAT 

SliMROw; % THE SUM OF ALL ELEMENTS IN eRRMAT 

TnT[Ot20i; % THE SUM OF EACH COLUMN IN ERRMAT 

THTDlAr,; t THE SUM OF THE DIAGONAL ELEMENTS IN ERRMAT 

TOTSUM; % THE overall percent CORRECT CLASSIFICATION 


F1(/^X13j5I7jX5M5^X5»F6.?)^ 

F?C//^ XS> ’’PERCENT ”^5F7.:* X15# F6* 2 
F3C/# X5^ ’'SUM*’#X5^5I7^ XSi. I5)i 

LlCFDP I 1 = 1 step I UNTIL MATOT DO ERRHATtMAT> 5UM[MAT1> 
PERCORtMATI)# 

L2CFOR 1 1 = 1 STEP 1 UNTIL MATOT DO TOTC U# SUHROW 
L3(F0R lisl STEP 1 UNTIL mATOT DO PERCQRCOI C n>TQT 5UH); 

write(Hnf#<///>x?3,«test error matrix »>;; 

WRULf LINE *</// /# x?5# "CL A SS IF I ED" # X20# "SUM", X5» "PFRCFNT"^ 
//,X5,»ACTUAL">); 


SPACECOUTAPF,! )j 

FDR, NSt=i STEP 1 iiJ^XJL MBEC.END D U 
R£AOa)U7APF»NP!XEND#CLASSENSj*n ; 
FOR MATl = l STEP 1 UNTIL MAT O T_J) ^ 

TFgTn 

REAOCC Ards, <I5>,NC ARDS); 



FOR ll=l STFP 1 UNTIL NCARDS DO 
BEGIN. 

RE AD{CAROS^<AI5>j» LR#LEf NSB#NSE ); 

lBi = If LB LSSJURl THELN NRl ELSE 

LEa=IF LF GTR NR2 THEN NR2 ELSE 
NSBl=IF NSB LSS K1 then Kl ELSE 
NSElxIF NSE GTR K2 THEN K2 ELSE 
NRECl I = INTEGERCCUB-NRU/INCR + 1 )J 
NREC?S=ENTlER((LE-NRl>/iNCR+l); 
NP1X11=IMTEGER((NSB*KI )/INCS + U; 
NPIX21=FNTIER( CNSE-KD/lNCS + n; 
GRNOC_MAT,ljUf«lB; 

CiRnDiMAT,!,?] isle; 


LBJ 

le; 

NSBl 

NSE; 


gRnD 

GRNO 

GRNO 

NCRO 


MAT, I, 31 1 =NSB; 
MAT, 1,43 I=NSe; 
t.MAT] |sNCARO_S; 


FOR NRECisNRFCl STEP 1 UNTIL NR EC2 DO 

FOR N P IX I .^iX t S tEP 1 U N T IL JiP I X2 DO 

[ ' E R R M A T : M A T , C L A S S [ N R E C , N P I X-I] 77^*^ 

I ERRMATEMAT , C L A S S I N R E C ^ N P I X - 1 II 1 ; 


fllR MAT 1=1 






jpFGlN 




FOP COL»=l STEP 1 UNTIL MATOT DO 



|sUMtHAT3* = SUMC«AT]+ERRMATtHAT#COUI 


IF SUMEMAT] GTR 0 

H 

... ... .. 

“then 

• — . ^ . . , ..... 

HHHI 



PERCORCmAT] I=CERRMATEMAT#MAT1/SUM[HAT] )X100 


» 

ELSEj 



i 

i 

SUMROHlsSUMROW+SUKtHATlI 

1 - 

WRITECLlt^Ejf l#U3J 


L_ 

end; 



.FllBL_COLt='l 

STEP 1 UNTIL HA TOT. DO. 


i 

RfGIn 




FOR MATlal STEP 1 UNTIL MATOT DO 



jTDTtCOLl *=T0TCC0L3+ERRMAT[MAT»CQLJ; | 

i 

j 

IF TOirCOL] GTR 0 | 

j 

^ LHFJU ^ ^ ^ 1 



PERCORCOLECOLI IsCERRMATt COL, COLT /TOT [CQL] >x 100 

.... . 

( 

t 

] 

IHHES 


. J 

I 

fnd; 'I 

WRITE CLINE 

#<//>); 

! 


»F3#L2)? 


FOR MAT«=1 

STEP 1 UNTIL MATOT ua. 

*• j 


TnTOIAG!«TOTOtAG+FRRMATt MAT^ HATI; 

, 


TOTSUMJalTQTDlAG/SUMROWlJCJlQO; 

-i 

[ 

( 

I WRrTECLlNF#F2» L3) J 

1 WRITEC0lSC4F2>L3)i 

? 

} 

L 

END OF TEST? 

J 









APPENDIX C 


ALGOL Listing for Procedure GAUSS 
Including Procedures 


CL AS SI 

CLASS2 

CHOLDETl 

CHOLSOLl 

CLASS3 

CLASS4 


If the procedure GAUSS is substitued for. the 
procedure POTENTIAL in GROUPL then a Gaussian 
Maximum Likelihood classification of the clusters 
is effected. 


PKOCt DURf:: GAUSSCKLASS^NCHANU#NCLUST>5IG/T0FILE^ CLASSINDEX# VALPDlNTf 

lNTEGE.fi NCLUbT?^^ % THE NUMBER OF CLUSTERS THAI HAVE BEEN 

% CREtAEED 

REAL AHKAT STgLO^O]; % AN AKRA^ CONTAINING THE AVERAGE SPECTRAL 

% SIGNATURES ASSOCIATED WITH EACH CLUSTERt 


TUi-lLE; 
KLASSi % 

NCHANUi X 

valpointt % 

CUASSINDEX; % 
ARRAY TRAN5PC03J % 


THE NUMBER OF CLASSES FOR WHICH GROUND 

number^of^IpectraPShannels on tape, 

A POINTER USED TO POINT THE BbGlNING 
OF VAUJ] IN matline. * 

A POINTER that POINTS TO THE CLASS 
NUMBER IN MATLINE. 

AN ARRAY CONTAINING THE CLASS NUMBER 


rO WHICH CLUSTER NUMBER 
BEEN ASSIGNED. 


begin 


FILF. IN TRAIN I) 1 SK "TRA I N’V"AX31 1 « ( 2^ 1 5 j 30 ) I 

LABEL fail; 

LABtL done; 

LABEL NFX7J 

FORMAT FMTlC/I6>H0)j 

integer array NUhTOjIA]; % AN ARRAY CONTAINING THE NUMBER OF SAMPLES 

% TRAINING THE POTENTIAL CLASSIFIER. 

INTEGER j; % AN INDEX VARIABLE CORRESPOND I Nr, TO THE 

% CHANNEL NUMBER, 

INTEGER NSi % AN INDEX VARIABLE CORRESPONDING TO THE 

5! THE CLUSTER NUMBER. 

INTEGER KL; % AN IN INDEX VARIABLE COKRESPUNdING TO fHE 

% CLASS NUMBER, 

INTEGER CL.rtSS; % AN INDEX VARIABLE CORRESPONDING TO THE 

% CLASS number. 

integer D?; % A PARAMETER USED TO COMPUTE THE 

% determinant FUNCTION. 

integer nc; % an index variable corresponding to the 

% channel number, 

integer NCl; % AN index variable corresponding td the 

% CHANNEL NUMBER. 

REAL GMAX; % THE MAXIMUM VALUE OF THE DESCRIMINANT 

% FUNCTION, 

real Di; X A NUMBER USED TO CALCULATE THE 

X determinant function, 

real G; % THL VALUE OF THE DESCRIMINANT FUNCTION, 

real DM; % A number USED TO CALCULATE THE 

% weighting FUNCTION, 

real q?; % a number used to calculate the 

% VALUE OF THE DESCRIMINANT FUNCTION, 

real 27} % A NUMBER USED TO CALCULATE THE 

% VALUE OF THE DESCRIMINANT FUNCTION, 

real ARRAY LT 0M2* 0 U 2 M % AN ARRA Y FOR STORING THE 

% COVARIANCE MATRIX ANDITS LOWER 

X TRIANGLEH TRANSFORMATION, 

real array SUMSQCO! J4#0M2/0 j I2i; % AN ARRAY FOR STORING THE 

% SUM OF THE PRDDECT5 VALCI3XVAL[J], 

REAL ARRAY MU [ 0 i 14# OM 2 T I % AN ARRAY FOR STOKING THE 

^ MEAN VECTOR EACH CLASS. 

REAL ARRAY MU1IUJ12]; % AN ARRAY FOR STORING THE MEAN 

% vector FOR THE ROWS OF MU. 

real array SUM[Oli4>Olll!]; % AN ARRAY FOR STORING THE 

■%. SUM OF VALLII, 

REAL ARRAY VAL[0M2]; % AN ARRAY FOR STORING THE - 

% SPECTRAL SIGNATURESOP EACH PIXELS, 

REAL ARRAY PT0*14#0n21# % AN ARRAY FOR STORING THE MAIN 

% DIAGONAL ELEMENTS OF THE LOWER 

% TRIANGULAR MATRIX, 

real % AN ARRAY CONTAINING THE WEIGHTING 

% FUNCTION OF EACH CLASS, 

REAL ARRAY Q t 0 S 1 4# 0 1 1 2 II 

REAL ARRAY 7tOM4#OJl2i; 


OEIGItfAi; PAGE ‘IS 
W POOK QUAEZra 


fhk klj-1 STLP 1 

BrGin ^ 


m 


NUMCKLJJsO; 

you NC:=1 step 1 UNTIL NCHANU DO 


UtGIN 

SUM[ KL^ NC ] »*0,0; 

FOR NClt=l STEP 1 UNTIL NCHANU DO 


BEGIN 


end; 


sumsqckl#ncjnci ] i=o; 

LC KL^ NC^NCn * “0; 


LNo; 


WHILE TRUE DO 


begin 


ENUf 


READ CTRAINj. 15^ MATLINEt*J ) [NEXT]; 


KL!-HATLINEtCUSSlMDEXi; 

FOR Jlsl STEP 1 U NTIL NCHANU UP 
I VAL[ J1 j=MATUN£CVALPOlNT-f J * U j 

CLASSKKL^ NCHANU^V AL^ NUMi SUM# SUMSGl ) ; 


CLASS2C NCHANU# HUM# SUM# SUMSQ# MU# UJ 


urKL# Ji; 


iffiywwy ■ “ 



EN DT 

WRITECLINEtPAGE] ); 

1*' K i TE CL I NE# <X 5# " NS?'# X 1 0#'*CLASS">) I 


Fan NSt-1 STEP 1 UNTIL NCLUST 00 

BEGIN- 

F UR STEP 1 UNTIL NCHANU DO 

vALi: siGr j#NSi; 

CLA5S»= i; ■ ■ ^ - - 

FOR KLt=l#?# 3#4#6 DO 



TRANSPCN S liaCLASS; 

W RI T E C H N L # r M T1#NS#CLASS ) ; 






Jv? Wi— PC? iW 


DCtUURE CIASSICKL^NCHANU# 
ntegeh kl; % 


NTfGtH NCHANU; % 

f.AL ARRAY VAlEO]; % 

NTEGLK array NU^tO] j % 


VAL#NIJM^SUM/ SUM5Q3J ^ , 

AN index variable CORRESPONDING TO THE 
CLASS NUMBERt , ^ 

AN INDEX variable CORRESPONDING TO THE 


CHANNEL NUMBER, 

AN ARRAY CONTAINING THE SPECTRAL 
SIGNATURES , , 

AN ARRAY CONTAINING the NUMBER OF SAMPLES 


FOR EACH class THAT ARE USED FOR 
TRAINING THE POTENTIAL CLASSIFIER. 


REAL array sum[0#oi; 




PRUCtUURE CLA 

integer 

X 

INTEgLH ARRftY 
% 

% 

REAL array 
REAL ARRAY 
REAL ARRAY 
REAL ARRAY 


SS2fNCHANU#NUM# 
NCHANU; % 

NUMCOII % 


LIST 

LIST 

FURMAT 

FORMAT 


hegii 


SKMt 0 > 03 i 

suMSfl[o>o>o 3 ; 

MULO/ 0 ]; 

Lro»o# 03 j 

L2CF0R NCt:il 
L3t for NC»=1 
FOR NCI 
FM12C//4E20i4 
FMT3C4E20*4]; 


SUM^ SUMSQ#HU#L); 

AN INDEX variable CDKRESROHD I NQ 
TO THE CHANNEL NUMBER. ^ ^ 

AN array containing THE NUMBER OF 
samples FOR EACH CLASS THAT ARE USED^ 
FOR TRAINING THE POTENTIAL CLASSIFIER. 


STEP 1 UNTIL NCHANU DO MUCKL^NC]); 

STEP 1 UNTIL NCHANU DO ^ ,,,, 

STEP I UNTIL NCHANU DO L t KL^ NC # NC 1 1) ; 
/)! 



FOR KL:“1 step 1 UNTIL KLASS PQ 

bFuTn 

FUR NCt=:l STEP 1 UNTIL NCHANU DO 

i MU[KL»NC3*g SUMlKLiNCJ/NUHCKLlj 

FUR NCtgj S TEP t UNTl L N C H_A N U ^ □ 

FUR NCn=NC STEP 1 U N T X L _N CHAN U DO 

1 L t K L^ NC > N C J T J = f 5 UM S Q I K L # N C a N C 1 3 / N U Mi K L ] ) - 

I 

WRITECLINE#FMT2^L2?; 

WRUECL1NEjFMT3>L33; 

ENu; - . - - . ■ _ 


end of CLASS2; 



PROCEDURE CHOl,DETUN,L>P#Ol>D?^ FAIU^ 


integer 

% 

RF.aL array 

REAL ARRAY 

real 

1N1LGEH 

% 

LARLL 
K 


N? 

L[U^O ,035 

PtU^O]) 

Di; 

D?; 


% AN INDEX variable CORRESPONDING TO 
THE channel number^ 


% A NUMBER USED TO CLACULATE THE 
DETERMINANT FUNCTION, 

% A number used TO CLACULATE THE 
DETERMINANT FUNCTION, 


FAILJ 

CHOLESKY DEFACTURIZATIUN TO PRODUCE L 
BEGIN 
INTEGER n 

INTEGER JJ 

INTEGER k; 

REAL v; 


Dllalf D2*sO; 

FUR 1:=1 STEL i UNTIL N DO 


FUR J:al STEP 1 UNTIL N_OQ 

BE GIN 

V* = UKL/I*xi]l 
FOR K:=I-t STEP -1 UNTIL 1 DO 


Vi-VLEKL# J^KlxLCKLf I^K3; 


M 


IF J EQL 
beTi N 


I THEN 


JQIIbSXUM.^., 

IF V EdTo THEN 

BEG I n"' 

U 2 *a 0 ; 

GO TO fail; 

FND 


m 


^ABSCDU GEQ I 01 


BEGIN 

D1 :sD1xO,06K5; 
D 2 ISD 2 + 4 ; 
,E.NO.L_ 


WHILE ABS(Dl) LSS 0,0625 DU 


BEGIN 

01 * = 01 x 161 

D2l=D2*4; 
end; / 


IF V LSS 0 THEN GU TO FAIL; 


PCKLM3* = 1,0/SQRTCV); 
] END 


ELSE LtKL# n * =VxPXKL^ n ; 


end; 

end of CHDLOETi; 


OP POOH V 



PRUCEQURE CHQLSEJLlCN>Lf B£X) J , 
INTEGER Ml % AN 

% , TD 

REAi. ARRAY l.CU^O^Ol? 

REAL ARRAY PCO^ 0]; 

REAL ARRAY R[0j; 

REAL ARRAY xrOfUJJ 


INDEX VARIABLE COKRESPDNDING 
THE CHANNEL NUMBER, 


% SOLUTION OF AXsB 

nEGTN 

INTEGER li 

INTEGER j; 

integlh k; 

' REAL . , 

% SOLUnON OF ly = bj 



FOR I!^l STEP 1 until N DO 


Bt OIN V»==8C I IT 

FOR Klsl-l STEP -i UNTIL 1 DO 

m=V>^l TKI , TrKTxXrKt # K 3 J ; 


^ end of cholsoli; | 


ORIGIKAIJPA^riE 

05* POOR QUALKE^ 






PROCEDURE CI.A5S3S 


i Utr ERMINE W 



begin 


W[MI:=0.0) 

FOR NCt=l STEP 1 UNTIL NCHANU DO 



DET :=9lx2*02i 

WtKL] :=-,5xWCKL]«.!>xLNCDET); 

wtKU »=h[ku+ln(approb[ku ); 


END J)F CLASS3; 





PFUICEDURE CI.ASS 4 ; 


t,LASSIF y 

BY G 

t.FuIN 


0 / !-u,o; 

Z 7 ;= 0 , 0 J 

FOK NC »=1 

STEP i until NCHANU DO 


I'tGlN 


end; 


QZ:=QZ+arKL^ NClxZtKL^NC]; 
ZZi = ZZ + Z[KL/MC3xZ[KL^NC] J 


laa a Mai Hor gt ni 


GMAK THEN 


HhGIN 


I t Nu; 


end of CLASS 4 J 


gmaxi=g; 

CLASSl=KLr 


I 




APPENDIX D 

ALGOL Listing of Procedure CHIMP 
Including Procedures 


DTJMPDATAROW 

DUMPDATA 

OUTFDTSUBLIST 

NEWCBOX 

NEWNODE 

INITIALIZATION 

SETTREE . 

INWINDOW 
INPUT 
DUMPTREE 
TREECLIMBER 
DISTSQ 
POTENTIAL 
DISCRIMINANT 
CLASSIFIEDCORRECTLY 
CHECKSUBLIST 
TREECHECRER 
CLASS IF?TRAIN 


If the procedure CHIMP is substituted for the procedure 
POTENTIAL in GRODPL then a hierarchical classification using 
the method of potentials is effected. 




<’«0ji[P!lfLXHlSS/l3iS*L?{E[l!r'^T6r?f??ViSs¥flEc85S?ggiu^ 

ncenters#sig#dutputarrav H 


'ngs^N, 

nlev# 

K§8P' 


NROHSI 

HINOOWSUE* 

A. « «■ A 


FILE TDFILEI 

SECOND* 

NCLUSTf 

mSxlevelJ 


NO or LEVELS OF CLASSIFICATION, 

!!Si|’oP°kE[ 8b§gi“o BE^MAiLKIth 

NO’OF TRAINING SAMPLES 

ISrg[illE5¥5l®¥§li?N5gG"SS?5i'’'‘ 

TlWf%§r«H¥cH”RlilNIN8 DATA IS READ. 
STARTING POSITION OF CLASS DATA IN FILE# 

lrS«lA?S*^o“^EltI8i!F?i 

NO, OF potential SFJl|?i7r|T?nN 
OEilRED level OF CLASSIFICATION* 


SiG COfOj# f%mm W 

OUTPUTARRAY E0#03? * 

E6IN X allocate GLOBAL DATA 
InV|g?§^ LINE PRINT C2» 15) 

ct^THEAD* 

TAVAILJ 

intege|°||ray 

TlAI^^tl^NLEV)/ 


ARRAY OF FEATURES OF^DATA TO BE CLASSIFIED 
FOR CLASSIFICATION RESULTS* 


COUN 


NK fllNROHS#llNLEVJ* 


cEli?LijlK..[li^RJ||§ 


TNODE tllNOOESi 

neSfeature* 


NSONS+UJ 


DEFI 
LABEL 


IIS?88E'H¥S881ji;i.ROHANi. 

N?“¥!lSEiS4?g£SSS;8f!!^IK^iHF> 


IPT* DONE! 








75d0 

7600 

7700 

7000 

7900 

7950 

BOOO 

8100 

8200 

8300 

8400 

8500 

8600 

8700 


%XXXX%XXXX%XXXtXXX%%XX%%XX%%X%%XXXXt%%%X%XXX%XX%XXXX%X%%%XXX%%X%X%X%X%X\ 

procedure: DUMPDATAROK CR0W*TAB3J 


INTEGER 
BEGIN 


ROH# TAB! 

INTEGER Ki NNCHANi NNLEVI 

nnchani«nchan; nnlfv t* nlevi 

WRITE C C I NE> 

NNCHAN<iFOR^KI»l STEP_l_UNTlL„N!i: HANARO FgATj/REC RpW#Kl7 


NNlEV 

HEIGHTtROWI^ 


*FDR K I»* 1 STEP 1 UNTIL NLEV 


in CLASSCR0H»K3^ 


”S8 


89 


XX 


9000 


NnCEV "^FDR KI **! STEP 1 UNTIL NLEV DO CLINKt ROW^KI )I 

xxxxxxxx^^xxt^xx^Mxhxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


•data; 


?I88 


9300 
9400 
9500 
9600 
9700 
9800 
9900 
ioooo 
10100 
10200 
10300 
10400 
10500 
10600 
10700 
10800 
10900 
11000 
11100 
11200 
11300 
11400 
11500 
11600 

il,'88 

11900 

120DO 

12100 

12200 

12300 

12400 

12500 

12600 

ilS88 

12900 

13000 

iH88 

13300 

13400 


^^JfilTi^lEfiL<)*fDUHPDATA»”>j; 

FOR K I»1 STEP 1 UNTIL NROHS DO DUMPDATARDHCKji 1 
END OF DUMPOATA; 


xxxxxxxxxxxxxxxxxxtxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxtxxtxxxxxxxxxxxxxxxxxxx 

PROCEDURE OUTPUTSUBLIST CP<LEVEL#KLASS)1 VALUE Pl 


INTEGER 

lIvel^ 

KLASS; 

BEGIN 

IF PkO then XRETURN 

mn 


END 
END OF 


while P>( 

®^*^SuHPDATAR 0H CPfLI 
Pl» CLINK CP^LEVi 

END 


;VEL3I 


dutputsublist; 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxtxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

INTEGER procedure NEKCBOXjf 


get 

THE 


A New data storage box, 

BOX AND CLEAR THE BOX., 


RETURN the LOCATION CROW NUMBER) 
ALSO ADJUST THE AVAIL STACK. 


OF 


BEGIN INTEGER NB#K| 
IF CAVAIL*0 


ELSE 


11188 


THEN BEGIN 

WRITE CLINE> 
<«***qverfldw data 
% generate divide 

CAVAILlwl/CAVAIU 

END 

begin 

NB. Ic CA 
FOR K«-l 
NEWCBOX m 


memdry***»'> ); 
e5?ception to halt exec 


ID 


13900 


'^88 


END OF 


nm 


NB 


1 


INK 

las 


CAVAIUl 

CNB.Kl+O; 


BOXl 


J4000 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

INTEGER procedure NEKNODEI 


1 14100 
14200 
14300 
14400 

1 14500 
14600 
14700 
14800 
■14900 
115000 

mm 

15300 
■ 15400 


this PROCEDURE RETURNS THE LOCATION 
BEEN CLEARED And READIED FDR USE 


OF A NEW NODE BOX WHICH HAS 


BEGIN 

IF 


integer K#NNf 

TAVAIL*0 THEN jj»jE,<„***dvERFL0H TREE MEMDRY**«>); 

TAVArL*»l/TAVAlL X GENERATF^DIV^^EXCE^ 

END ; X TD HALT EXECUTION 

NN ?* TAVAIL) 

NEHNODE I* NN) 

END OF NFWNODEI 


DO TN0DEI:NNVK3I»'0I 


104 


15500 

15600 

15700 

15S00 

15900 

16000 

16100 

16200 

16300 

16400 

16500 

16600 

16700 

16800 

16900 


4 vvr 

18900 

18950 


X%XXX%%%%XX%%%X%X%%%X%t%%%%^^t%%X%%%%%XXX%%Z%%%X%XXXXXXX%%XX%%X%%X%X%X 
PROCEDURE INITIALIZATIONS 


Initialize linked sTORAOEt 


BEGJ 


N integer K, 


! t»K + l s 
tK+i; 


POR KfSl STEP 1 UNIIL NROWS-l DO CLISTLINKCK3 tsK+l S 

FOR*Kli"l^STEP^{*UNTIL NCMAN*5o IlNDOHCKJ I bWINDOWSIZEJ 
END OF INITIALIZATION! 


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
PROCEDURE SETTREE (NEHCLASS)! 

I S^s^g§^g°8f?AT^I°TrLS?A^fISg rV5fPNS8E§^^^!!9“p*S?N?^fg^!rSP”* 


SU0LISTS OF "NEWCLASS",^^, . . 
INTEGER ARRAY NEKCLASSIllJ 


begin 


TROoPPo^THePtM l« NEWNODE! 

fqr\*?Pi step r until nlev do 

IF 0<NC l» NEWCLASStLI THEN 
RrfiTN 

Q l> TN0DEtP#l+NC5! ^ 

IF 0 » 0 THEN BElllN. 


TRAIULI I* aj 
P I* «l 

IPsE TRAIULTI»0J 
END OF SETTREE! 


Q nehhqdei 

TNODEtPM+NCI tf* 01 
END! 


’ll I gr?B^'^llTSSP?5"RS8 


real array NEHFEATUREt ni 
INTEGER. P! _ _ 


TCdTPPf^R Pi 

BEGIN boolean BJ INTEGER K! 

roPKlSi^STEP t 

ABS€FEATURECpJl^^ HINDOHCKI 

INHINDDH I« 81 
END OF IMWTNOOHJ 






21000 

li ^38 


PROCEDURE INPUT CNEWFEATURE#NEWCUA5S H 

INPUTS «NEHFEATURE« AND "NFWCLASS” TO LINKED STORAGE. 


}io8 X "nIwfIaIurI” Is fK THE®k!eNOQW of an I^ISTINS sample then it 

1500 X CLUSTERED WITH THAT SAMPLEl ELSE IT IS PLACED IN A NEW STORAG! 


2 U 00 

21500 

i\m 

21800 

21900 

22000 

imi 

22300 

22400 

22500 

22800 

22700 

22800 

22900 

23000 

23100 

23200 

23300 

23400 

23500 

23600 

um 


24000 

24100 

24200 

24300 

24400 

ISI88 


»7400 

!7500 


►8400 

!8500 

ilf 88 


INTEGER ARRAY NEWCLASStl3> 
REAL ARRAY NEMFEATURE^t H/ „ 
BEGIN INTEGER CMARK/» Tj> K*L^ P? 
SETTREE (NEWCLASS)J 

PoR^^LliNL?} STEP. -l_UNTI! 


label xit; 


POR LI*NLEV STEP -I UNTIL 1 DO 
IF 0 NEO TinTRAILCLl THEN 
RFG I N 

ir CMA.K > 0 THEN B«Jgi,KtCHARK.Ln.TNgDEtT.n, 

TN0DEET>13 CMARK 

END 

ELSE BEGIN 

not INHINDOH CNEWrEATURE^P) 

then p*«clinkcp#u 

ELSE 

BEGIN _ _ 


IF WETQHTCP1>HAXPDT then MAXPOT^'HEIGHTXPIl 
GO TO XIT 

END 

ENOi 

CLfNK£CMA?!l???^|i TN0DEET.1I; 

mJf'ilEVrailL 

FEATUREfCHARK.KI t* NEHFEATURECKlJ 
HEI6HTECMARK3 tn 11 

END 

XITI END^OpVlNPUTl 

PROCEDURE DUMPTREEI 

BEGIN INTEGER M.NJ • „ ' ■ 

WRITE CLINE# <«DUMPTREE»*>)I 
FOR Ml»l STEP 1 until nodes DO 
WRITE CLIME#<20ll>#M ^ * 

FOR Nf«t STEP 1 UNTIL NSONS +1 DO 
TNODE EM#N3) 1 
END OF DUMPTREEI ^ 

xxxxxxxxxxxxxxxxxxxxxtxxxxxxxxnxxxxxxxxxxxxxxxxpxxxxxxx^^ 

procedure TREECLIMBER (LOC#LtVEL#KLASS) I VALUE LOC.LEVEL.KLASSI 

X this PROC. TRAVERSES THE CLASSIFICATION TREE IN END ORDER, KHEN^A 
X NODE IS vfsTTED# THE CORRESPONDING SUBLlST IS OUTPUT (VIA «OUTPUTSUB‘ 
r i.!ST”) AND THEN EACH Of THE CHILDREN ARE VISITED. 

X ^*'°^INTEGER*ARRAV TNODE Cl 8 NODES# 1 8 NSONS II 


integer log# 

mm 


% location OF THE NODE 
X CLASSIFICATIaN ^OF.TH-,,. 
i level of THE node VTSITE 


I S ITED 
^SjjBLIST 


BEGIN 


NTEGER KaNLOCI 
F LOC « 0 THEN 


IF LOC « 0 
ELSE BEGjN 


X RETURN 


IF LEVEL NEQ 0 THEN aUTPUTSUBLISTCLOC#LEVEL#KLASS); 
FOR K l» I STEP 1 UNTIL NSONS., .DO 
IF CNLQCI«TN6DEtiOC#S^Kn > 0;TH 

TREECLIMBERCNLOC»LEVEU'i#K)l 


END Of THE PROCEDURE TREECLIMBER 1 


t %% XX % X %%%1 L 3 L %. X %%% X % XXXt %% X %%% t % t % XX % X % X % XXX %%%% XX %%%%%%% tX %%% i .%%% X %% XXX % 


X 

% 


iSioS 

urn 

?9«0O 
29900 
30000 
30100 
30200 
30300 
30000 
30500 
30600 
3oroo 
30800 
30900 
31000 

im 

31300 
31000 
31500 % 
31600 X 

1U88 I 
llo88 

32100 

32200 

32300 

32400 

32300 

32600 

SIroo 

32800 
32900 

33000 
33100 
33200 % 
33300 S 
33400 
33500 
33600 
33700 
33900 
33900 
34000 
34100 
34200 
36300 
34400 
34500 
34600 
34700 
34800 
34900 
35000 




COMPUTES THE 

xfeature and 


Square of the euclidean distance between 

THE VECTOR FEATUREtROW#*!. 


REAL ARRAY XFEATURE fill 
INTEGER ROHI 


integer KI 


BEGIN 

REAL SUM! 

SUM 4- 01 

’^'^^SUMisuS+’^CXFEATUREcSl*- ?eATURECR0H#K3 )*2I 
DISTSO t SUMI 
END OF DISTSQI 


%%%%x%xtx%%%xxxx%%%%%x%%%x%%%x%xx%x%%%%x%%x%%mx%%%%x%%%%%%%xxxx%%txx%% 
-^*!*ofENflAL*^fxFEATURE*ROWjL£VEL)l 
evaluates TH^ ^OtLOHJjG^PgjENTIAL^F^ 

HElGHTtROHlx- 


1 ♦ ALFA X fFEATURECROH#*] • XFEATUREC*3 ) * 

i 


REAL ARRAY XFEATURE f 1 3 
INTEGER ROW# LEVEL! 


begin 

POTENTIAL 


END OF POTENTIAL! 


^xCliuMlDAxioONTCROW^L 
/(14.ALFAXDISTSQ CXrEAT 


EVEL3) 

UREjROW))! 


REAL PROCEDURE DISCRIMINANT CXPEATUREAISTHEAD^LEVEL)! 

evaluates THE DtSCRIHlNANT FUNCTION FOR A SUBCLASS 
AT THE POINT XFEATURE, 

REAL ARRAY XFEATUREtllJ 
INTEGER LISTHEAO# 

‘ LEVEL! ■ ■ ■ ■ ■ 


P, 

|»0 


REAL SUM! 


hH?£e’p> 0 Do“rM|y|‘iHMu§H^ JpIatuSP”*'- 


Km . SUM , 

P f CLINKCPi.LEVELl! 

lyi^RiMlNANT 4- SUM ! 

END OF DISCRIMINANT! 



43801 

43900 

A4nnn 


nm 

X 

35500 X 
35600 

nm 

.888 
36100 
36200 

urn 

36500 

36600 

itm 

37200 

37300 

37400 

37500 

^7788 

37800 
37900 
3flOOG 
38100 
38200 
38300 
38400 
38500 
38600 

nm 

38900 
39000 
39100 
39200 
39 300 
39400 
39500 
39600 
39700 
39800 
39900 
40000 
40200 
40300 
40400 
40500 
40600 
4Q70Q 
40800 
40900 
41000 
411 

41300 

tm 

41600 
41700 
41800 
41900 

nm 

42200 
42300 
424 00 
42500 
42600 
«l700 
42800 
42900 
43000 
43200 
43 300 
43400 
4 3500 
43600 


BDO 


. .1. . t ■ ,1 ^ I - — 

l-E«gs???gg8[!lECTtY (CLOCjLEVEI#PARENT)I 


DETERMINES WHETHER DR NOT A TRAINING SAMPLE IS CLASSIFIED 
correctly by the present discriminant FUNCTIONS. 


ls^R^NtJ 


X A LTCATION IN CLIST 
X parent of this sublist 


that is the 


ARRAY XFEATURE CltNCHANI^ 

INTEGER ^ 

K-* Sj SIGCLASSI 

^^^felGVALUE/ d; , ^ . 

i!:!?ikT88?f?LD8?Kii 


X the feature vector to be 

X CLASSIFIED 




B 


'^F'g?8^A£EPI^^;^^t^R^M?NIi^f^XFEATURE, 

TNODEf TNODEEPARENT^ i+Sl# U/LEVEL) 
THEN BEGIN 

BIGVALUE«-Di BIGCLASS 4 .S I 

CLASSIFIEDc8rR^CTLY i‘ classecloc^level]«bigclass; 
END OF CLASSIFIEOCORRECTLYI 


BOOLEAN PROCEDURE CHECKSUBLlST CLlSTHEAO#LEVELjiPARENT 3 J 

TRAVERSES A SUBLlST TO DETERMINE WHETHER OR NOT ALL,TRAINING 
S^^ek?S,IN THE SUBLIST AR|_CDgRECTLY..CLAS§If IEO^B^J. THE 
DISCRIMINANT FUNCTinNS. 


classified^ Count is 
IMMEDIATE EFFECT DN 


. FOR EACH sample INCORRECTLY 

incrementeo immediately, this has 

THE DISCRIMINANT 


An 


FUNCTION, 


LXSTHEAO# X THE LOCATION OF THE SUBLIST^^ 

LEVELS X THE ‘LEVEL Of CLASS TO BE CHECKED ,;,,^ 

X THE LOCAnON OF THE PARENT TREF NODE 


PARENT! 

^^^RSQLEAN B J 

iSTluf? piLlSTHEAD! 


REAL POT ! 


X MOVE THROUGH THE LIST CHECKING EACH ELEMENT 


WHiLE’'P>0 p8’ 

NOT CLASSIFIEQCORRECTLY CP#LEVEL#PARENT3 
THEN BEGIN X INCREMENT COUNT 


P 4 . 
ENDI 


CL 


.OUN^fPiLEVELl^COUNTCP^LEyEU + il 
fNKlPjLEVELII X MOVE P DOWN LIST 


END'BP?§^8fei6k!s?S 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxsxxxxxxxxxxxxx 

boolean PROCEDURE TREECHECKERCL0C^LEVEL#PARENT3! 

X traverses all sublists to determine whether or not all SUBLISTS 

% are correctly CLASSIFIED BY THE DISCRIMINAT FUNCTIONS, 

INTEGER 
LOC/ 

BEGIN 

BOOLEAN B! 


X A TREE NODE LOCATION 
% 

X 


^HE^lSc’-^F^HE PARENT OF NODE AT LOC 


lEi 


SjiSON! 


END OF 


integer 

SE^BlbHECKSUBLllTCTNODECLOC^l I. LEVEL. PA RENT)! 
FOR S4-1 STEP 1 UNTIL NSONS DO 
IF tO<SON<*TNDDE[LOC. t + Sl ) 

then B4-0 AND rr 

TREECHECKER 4- B! 

TREECHECKEr 


‘R EEC H EC K E R (SON. L EV E L i # L 0 C 5 ! 




^;§83 

44400 

44500 

44600 

44700 

IS588 

45000 

45100 

45200 

45300 

45400 

45500 

45600 

45700 

45B00 

45900 

46000 

46050 

46100 

46200 

46300 

4640C 

46500 

?i?88 

46600 
46900 
47000 
47100 
47200 
47300 
47400 
4750 0 
47600 
47650 
47660 
47700 
47900 
47900 
48000 
48100 
46200 
48300 

48600 

48700 

48800 

49000 

49100 

49150 

11188 

49350 

49351 
49360 
49400 
49500 
49600 ! 
49700 
49800 ! 
49900 
50000 
50100 
50200 
50300 
50400 
50490 [ 
50495 
50500 
50530 
50600 
50700 
lOBOO 
50900 
5100G 
51100 
51200 
51300 
51400 


CLASSiriCS "NEHFtATuRE:" LEVEl-BY^LEVEL CUP tO^MAXLCVEL 5 j 


URE^NEWCLASS 

^^^XNTEGER LEVEL^PjBIOClASSf K# j; 
REAL BIGVALUE^O I 

MAXLEVEL,<* HINCHAXLEVEL# NLEV)I 
P ♦ TROOT; 

LEVEL t“ OJ 

WKILE P>0 and LEVEL4MAXLEVEL 00 


% nehfeature is classified 
X result in newclass 


BEGIg 


;i LEVEL +^i I 
JIGVALUE In 01 

Kti Step i unti 


FOR K<-1 §TEP 1 UNTIL N 
IF tnode!p#i+k3>o TREN 


filGCLASS tM O; 
L NSONS DO 


BlGVALUE<D*OISgEI^INJNJ^N|Wg^JTyR^,^ 

T^^/^ ®^S{8vALUE<' Dl BIGCLASS^- Kl 
end; 

fr'^llGVALUE GEO THRESHOLD 

THEN ®^®J8wclassclevelh bigclass ; 

P4. TNODECP^l + BIGCLASS}; 

END 

else begin . , 

NEWCLASStLEVFLI*- "I f P<- 0 ^ 

FOR JI-L^EL + l STEP 1 UNTIL NLEV DO 
NEWCiASStJ3l«0; 

end; 


ENO OF classify; 


PROCEDURE TRAIN ; 


:eCUTE 5 "TREECHECKER” AT MOST 20 TIMES OR UNTIL^ ALL TRAINING 
iIA Are CLASSIFIED CORRECTLY BY THE DISCRIMINANT FUNCTIONS. 


^^^^NTEGER,I; 

PoR ST^P 
BEGIN 


boolean ok; 

1 WHILE K20 and NOT OK DO 


IPT* 


DONE* 


WRlTE?LfNE^<®fRn5!NQ WAS^^fLS^IlO^" PASSES USEO"># OK# DJ 

WRITECLINE#<»ELAPSED TIME! PR#10">2R15,4>. 
TIMEC2)/60^TlMEC3)/60); 

■ end; 

END OF TRAIN! 

CHIMP EXECUTION 

i RlSfD^cfyftEJF^RST ♦NCHAN^NLEV# DUMMYC * 3) CD0NE3I 
FDR K«-l STEP 1 UNTIL NLEV 00^' 

NEWCLASSCk! 4- dummy tFZRST+K"^l 3; 

FOR K4-1 STEP 1 UNTIL NCHAN DO „ , , 

NEWFEATURE tK3 DUMMY t SECDNO + K-1 3 ! 

INPUT CNEHFEATURE#NEWCLASS5! 

E» ^°NcPnIeRS l* CAVAIL> U ^ Ha 

HRITECLlNE^<«NCENTERS*ti,l6>,NCENTERS3; JVVio To 

IF Debug then dumpdata; 

FOR^kSo step 1 UNTIL NCLUST •! DO 7 

®^^FOR STEP 1 UNTIL NCHAN DD 
NEWFEATURE tM3 SIG EM#K3; 

CLASSIFY (MAXLEVED; 

FOR M«-i STEP i UNTIL NLtV DO , , 

OUTPUTARRAY CK#H3 <*• NEWCLASS IM3; 

- end; 

END OF chimp; 


